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Journals unite for reproducibility 


Consensus on reporting principles aims to improve quality control in biomedical research and 


encourage public trust in science. 


tion are cornerstones of the scientific method. Of course, just 

because a result is reproducible does not make it right, and just 
because it is not reproducible does not make it wrong. A transparent and 
rigorous approach, however, will almost always shine a light on issues of 
reproducibility. This light ensures that science moves forward, through 
independent verifications as well as the course corrections that come 
from refutations and the objective examination of the resulting data. 

It was with the goal of strengthening such approaches in the 
biomedical sciences that a group of editors representing more than 
30 major journals; representatives from funding agencies; and scien- 
tific leaders assembled at the American Association for the Advance- 
ment of Science’s headquarters in June 2014 to discuss principles and 
guidelines for preclinical biomedical research. The gathering was 
convened by the US National Institutes of Health, Nature and Science 
(see Science 346, 679; 2014). 

The discussion ranged from what journals were already doing to 
address reproducibility — and the effectiveness of those measures 
— to the magnitude of the problem and the cost of solutions. The 
attendees agreed on a common set of Principles and Guidelines in 
Reporting Preclinical Research (see go.nature.com/ezjllp) that list 
proposed journal policies and author reporting requirements in order 
to promote transparency and reproducibility. 

The guidelines recommend that journals include in their informa- 
tion for authors their policies for statistical analysis and how they review 
the statistical accuracy of work under consideration. Any imposed page 
limits should not discourage reproducibility. The guidelines encourage 
using a checklist to ensure reporting of important experimental param- 
eters, such as standards used, number and type of replicates, statistics, 
method of randomization, whether experiments were blinded, how 


R eproducibility, rigour, transparency and independent verifica- 


the sample size was determined and what criteria were used to include 
or exclude any data. Journals should recommend deposition of data in 
public repositories, where available, and link data bidirectionally when 
the paper is published. Journals should strongly encourage, as appropri- 
ate, that all materials used in the experiment be shared with those who 
wish to replicate the experiment. Once a journal publishes a paper, it 
assumes the obligation to consider publication of a refutation of that 

paper, subject to its usual standards of quality. 


“The guidelines The more open-ended portion of the 
encourage using guidelines suggests that journals establish best 
achecklist to practices for dealing with image-based data 
ensure reporting (for example, screening for manipulation, 
of important storing full-resolution archival versions) and 
experimental for describing experiments in full. An exam- 


ple for animal experiments is to report the 
source, species, strain, sex, age, husbandry 
and inbred and strain characteristics for transgenic animals. For cell 
lines, one might report the source, authentication and mycoplasma 
contamination status. The existence of these guidelines does not obvi- 
ate the need for replication or independent verification of research 
results, but should make it easier to perform such replication. 

Some of the journals at the meeting had already had all or most of 
these principles and guidelines in place. But the point is that a large 
number of scientific journals are standing together in their convic- 
tion that reproducibility and transparency are important issues. As 
partners to the research enterprise in the communication and dis- 
semination of research results, we want to do our part to raise the 
standards for the benefit of scientists and of society. The hope is that 
these guidelines will be viewed not as onerous, but as part of the quality 
control that justifies the public trust in science. = 


parameters.” 


On the mend 


The scientific regeneration of central Europe is 
gathering pace, but needs further help to thrive. 


almost exactly 50 years after Nazi Germany’s assault on Poland 

triggered the Second World War, was perhaps the brightest 
moment in Europe’ twentieth-century history. The fall of the Berlin 
Wall restored political and personal freedom to central Europe, where 
people had endured Hitler’s atrocities only to find themselves ruled 
by Soviet despots. It is a small miracle that the rich learned tradition 
of the region survived two consecutive tyrannies. 


Ts peaceful implosion of communism in the autumn of 1989, 


Science in liberated central Europe had to adapt quickly to survive 
in the free world. Governments, intellectual elites and academic insti- 
tutions in the region were all equally unprepared for the political sea 
change that occurred after 1989. A quarter of a century on, the trans- 
formation to parliamentarian democracy and a market economy has 
been achieved. Science was generally nota priority in the early years of 
the transition. But from 2004 onwards, membership of the European 
Union (EU) provided a boon that some countries are prudently using 
to rebuild their research capacities (see page 22). However, despite gen- 
erous subsidies from Brussels, other countries have a long way to go. 

The region’s main asset is a growing pool of young talent that is 
rediscovering science as a worthwhile profession. This genera- 
tion rightly demands more support and more constructive politi- 
cal vision than some of the region’s current governments have 
to offer. In Romania and Bulgaria, where those in power are 
stubbornly obstructing reform, science is losing out. And in Hungary, 
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where pluralism is under threat, the writing is on the wall. 

The last generation of scientists who trained under the communist 
regime had little faith in science in their countries. When freedom 
arrived, most of them grasped vastly more lucrative business oppor- 
tunities or chose to migrate permanently to the West. The resulting 
shortage of mid-career scientists can be seen in almost any research 
department in the region. It is also the main reason why central Europe 
attracts shamefully few grants from the European Research Council 
and why the region is lagging behind in terms of its overall scientific 
output (see graphic on page 24). 

But interest in science and higher learning is rising sharply. In 
Poland, student numbers have increased fivefold since 1990. Overall, 
more than one-quarter of the 20 million students in the EU are now 
from the new member states. The vast majority of them were born 
after the demise of communism. 

Young scientists do continue to leave, and so they should, but the 
excessive brain drain has thankfully ceased. Most countries in the 
region have in recent years created transparent and strictly merit-based 
science funding systems. But alot more needs to be done — especially 
at universities — to help talented young scientists gain independence at 
an early age. Institutes that do employ young independent group lead- 
ers, such as the International Institute of Molecular and Cell Biology in 
Warsaw, are reaping the benefits. Furthermore, funding agencies across 
the region should expand, and better advertise, programmes aimed at 
repatriating young foreign-trained scientists. 

Universities in the region are a long way from scoring in the upper 
ranks of international academic comparisons. But the idea that 
central Europe remains a poor relation in global science is obsolete. 


Since 2004, numerous labs have been re-equipped to facilitate 
competitive science. The best institutes now offer conditions on a par 
with those at aspiring labs in Singapore, China or Saudi Arabia. But 
science managers in central Europe lack the bravado with which the 
new players in Asia and the Middle East trumpet their strengths and 

the aggressiveness with which they recruit foreign talent. 
The new European Commission that takes office this month must 
help the region to raise its scientific profile. 


“The idea that The EU’s €80-billion (US$100 billion) Hori- 
central Europe zon 2020 programme, which started this year, 
remains a poor includes a scheme that invites less-potent 
relation in member states to open new research centres, 
global science or upgrade existing ones, in partnership with 


richer countries. Leading research institu- 
tions in the West should accept the invitation. 

Collaborations involving high-profile British universities, say, or the 
prestigious German Max Planck institutes, would no doubt raise the 
visibility of central European science and help to improve the region’s 
participation in EU-funded research. Structural funds will also remain 
essential. Billions have already been earmarked for the 2014-20 period, 
and research is to remain a major beneficiary. But the commission 
should closely monitor the effectiveness of the investment. 

Science is at the heart of the EU’s policies. Its renaissance in many 
parts of central Europe serves as an example of successful European 
integration at a time when forces threatening the EU’s social and politi- 
cal cohesion are gaining strength. Domestic neglect of science in the 
continent's southeast risks casting the EU’s poorer countries even 
further adrift from the rest of Europe. = 


is obsolete.” 


Protect the parks 


Balancing the needs of development and 
conservation is difficult — but urgent. 


he World Parks Congress could not be in a better location this 
year than Sydney, Australia. The gathering of researchers, policy 
experts and conservationists occurs only once a decade, and it 
arrives in Australia next week as debate over protection of one of the 
most famous parks in the nation — and the world — reaches fever pitch. 

The Great Barrier Reef is celebrated as one of nature's true wonders. 
In the popular imagination it is the quintessential coral reef. Millions 
visit every year to swim in its waters, and millions more dream of such 
a trip. Itis rightly a source of pride for many Australians. 

But the reefis in trouble, beset by direct human activity in the form of 
coastal development and indirect activity in the form of climate change. 
It exemplifies the problems that congress attendees must struggle with. 

The International Union for Conservation of Nature — which over- 
sees the meeting — says that the 2003 parks congress in Durban, South 
Africa, led to major advances, including ongoing work on protected 
areas under the Convention on Biological Diversity, and revisions to 
the system used to manage protected areas. 

The world has changed since 2003, when political rhetoric around 
climate change was still gearing up. Levels of carbon dioxide in the 
atmosphere — measured at the US National Oceanic and Atmospheric 
Administration's Mauna Loa observatory in Hawaii — have increased 
from an average of 375.77 parts per million (p.p.m.) to current peaks of 
more than 400 p.p.m. on some days, the highest for thousands of years. 

It is true that there has been notable progress in creating protected 
areas, especially in the oceans: huge parks were established in the 
Pacific by US President George W. Bush and then expanded by Presi- 
dent Barack Obama. These join similar areas, including one around the 
Chagos Islands in the Indian Ocean, created by the United Kingdom. 
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Progress towards internationally agreed targets to protect 10% of the 
world’s oceans is slow and behind schedule, but it is progress nonethe- 
less. And growing environmental movements are making themselves 
heard in rapidly developing economies such as China. In this issue of 
Nature, seven researchers set out their vision of how the world’s parks 
should develop (page 28). This work is more important than ever. 

There have been setbacks. Extensive networks of parks across Africa 
have failed to safeguard rhinoceroses and elephants from surging poach- 
ing. Deforestation continues apace. And the Great Barrier Reef itself has 
been hard hit, with coral cover falling at a worrying rate (see page 16). 

Ecosystems are complex and ever-changing, and it is difficult to 
prove that protections make a difference. Well-funded and well-organ- 
ized business lobby groups seek to develop many important areas, and 
development often still trumps wildlife. People quite legitimately want 
to improve their own lot, even at the expense of other species. Wealthy 
nations are able to seal off certain areas for protection, but developing 
nations in which people rely on subsistence hunting and fishing may 
not have that luxury. 

How to measure the value of the world’s ecosystems is still debated. 
Edward Barbier argues on page 32 that economists tracking growth 
have done us a disservice by omitting from their calculations the 
value that has been lost by, for example, turning a mangrove forest 
into a shrimp farm. 

And conservationists still argue over why we should protect the planet 
— for its economic value or for its own sake. On page 27, Heather Tallis 
and Jane Lubchenco say that “vitriolic” battles over this tension endan- 
ger conservation science. “It is time to re-focus the field of conservation 
on advancing and sharing knowledge in all relevant disciplines and con- 
texts, and testing hypotheses based on observations, experiments and 
models,” they write in a petition with 238 other signatories. 

In conservation, it is often not clear when deadlines are until they have 
passed. At the next World Parks Congress, around 
2024, what will attendees discuss? Will there be 
any truly wild rhinos left? Will the Great Barrier 
Reef be in terminal decline? Hopefully not. But 
this year’s attendees have their work cut out. m 
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WORLD VIEW jernisicor sen 


what you saw? And then volunteer to identify the arrested suspect 

ina line-up? That sounds like the right response, but it might not 
be. Almost 25 years ago, a colleague and I published a psychology 
study (J. W. Schooler and T. Y. Engstler-Schooler Cogn. Psychol. 22, 
36-71; 1990) that indicated that to describe the physical appearance 
of the criminal could make it harder for a witness to subsequently 
identify the person they saw. 

The effect is called ‘verbal overshadowing’ and its discovery proved 
controversial, and not simply because of what it means for detectives. 
Over the years, other researchers (myself included) have had mixed 
success replicating the finding. Some have doubted whether it exists 
at all. That 1990 research project has been used as an example of the 
‘replication crisis’ that has engulfed science in 
recent years. In disciplines such as medicine, 
psychology, genetics and biology, researchers 
have been confronted with results that are not as 
robust as they originally seemed. 

In response, scientists have launched various 
replication projects to assess the robustness of 
published research. In psychology, numerous 
labs have volunteered to re-run studies, with the 
methods vetted by the original researchers. My 
discovery of the verbal-overshadowing effect was 
an obvious target for this approach, and so, last 
year, psychologists at 31 different laboratories 
across the world signed up to repeat the study 
and report the results. 

How did I feel about having work scrutinized in 
this way? I thought that I could not lose. Positive 
replication would confirm the important verbal- 
overshadowing effect. Failure to replicate would be more evidence for 
the ‘decline effect; an idea I endorse that the size of an effect decreases 
over repeated replications, for reasons that are not fully understood. 

Unfortunately for a replication study, there was a mistake in the 
timing parameters of the initial experimental protocol. Still, this unex- 
pected negative turn of events took a positive spin as the deviation 
from the original protocol, once identified and fixed, generated some 
useful comparative data. 

Some 22 of the original 31 repeating labs went on to follow the cor- 
rected study protocol. Pooled, the results confirmed the original finding. 
The verbal-overshadowing effect was clearly demonstrated (although 
the effect size was smaller than in our 1990 research and observed only 
when the original parameters, in the corrected protocol, were followed). 

The outcome is a genuine victory for the 


E you witness a bank robbery, what should you do? Tell the police 


emerging field of metascience, an approachin NATURE.COM 
which science turns the lens of scrutiny on itself. _ Discuss this article 
Metascience, the science of science, uses rigor- _ online at: 
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Metascience could rescue 
the ‘replication crisis’ 


Independent replication of studies before publication may reveal sources 
of unreliable results, says Jonathan W. Schooler. 


influence the validity of scientific conclusions. It has its roots in the 
philosophy of science and the study of scientific methods, but is 
distinguished from the former by a reliance on quantitative analysis, 
and from the latter by a broad focus on the general factors that con- 
tribute to the limitations and successes of research. 

Large-scale replication efforts such as this one are important, 
but they are expensive, time-consuming and impracticable for the 
vast majority of scientific studies. Rather than focus exclusively on 
whether past studies stand up, we need a clearer sense of the processes 
that influence the reliability of new findings. These could include how 
‘invested’ researchers are in the original hypothesis, the number of 
times a protocol is repeated and how the methods and outcomes are 
assessed and written up. 

Together with labs at three other universi- 
ties, my research group has begun an initiative 
to test the reproducibility of our science. Rather 
than re-examine published studies, each lab has 
agreed to allow the other three to generate new 
findings, replicate each others’ experiments 
and compare the results of new studies before 
they are published. We will then be able to judge 
differences, for example, in the results obtained 
by the originating lab compared with those 
that follow the idea, and so, in theory, have less 
invested in the results. 

It is clearly not feasible for all researchers to 
follow this approach in their routine work. But 
it should offer a valuable academic exercise to 
examine the factors that affect reproducibility as 
they arise during the course of research. 

By pre-registering all aspects of new scientific 
studies and then repeatedly trying to replicate them, the project allows 
careful scrutiny of all parts of the research process, from inception 
to replication. If the studies replicate flawlessly, we will have estab- 
lished a gold standard for reproducible studies. If they do not, then our 
approach will present an opportunity to rigorously assess the reasons. 

Some might suggest that the focus on replication within psychology 
is an indictment of the field. It is precisely the opposite. All fields face 
problems with reproducibility, and psychology should be applauded 
for its willingness to tackle the issue empirically. 

In fact, psychological science has long been at the forefront of 
refining and improving the scientific process. The understanding of 
experimenter expectancy effects (a form of cognitive bias) and the 
importance of double-blind trials emerged first in psychology. Such 
self-examination can only strengthen the scientific process for all. m 


Jonathan W. Schooler is a psychologist at the University of 
California, Santa Barbara. 
e-mail: jonathanwschooler@gmail.com 
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CLIMATE CHANGE 


Greenland’s ice at 
mercy of ocean 


As the climate warms, changes 

to the ocean around Greenland 
could cause its ice sheet to melt 
where it currently seems stable. 

Camille Lique and her 
colleagues at the University of 
Oxford, UK, used data from 
an integrated climate model 
to study a worst-case scenario: 
concentrations of atmospheric 
carbon dioxide that increase by 
2% each year for 70 years until 
they are four times the current 
level. The authors found that 
the resulting changes in ocean 
circulation will contribute 
to ocean warming across the 
entire region, by up to 5°C 
in places. This will lead to 
increased melting of marine- 
terminating glaciers across 
most of Greenland. 

The potential for widespread 
ice loss suggests that ice-sheet 
monitoring should not be 
limited to areas of current, 
rapid melting, the authors say. 
Clim. Dynam. http://doi.org/wsq 
(2014) 


Chimps plan for 
better breakfasts 


Wild chimpanzees plan their 
days to improve the chance of 
finding tasty fruit for breakfast. 
Chimps (Pan troglodytes 
verus) like ripened figs 
(pictured), but these treats 
are available only for short 
periods of time and are sought 
by other animals. To find out 
how chimps secure the prized 
fruit, Karline Janmaat and her 
colleagues at the Max Planck 
Institute for Evolutionary 
Anthropology in Leipzig, 


Germany, monitored five wild 
female chimps at Tai National 
Park in the Ivory Coast for 
275 days over 2 years. They 
found that when figs were ripe, 
the animals often left their 
bed nests before dawn, and 
departed earlier when the fig 
tree was farther away. 

Such flexible planning may 
have supported the evolution 
of calorie-hungry big brains 
in other primates and 
ancient human ancestors, the 
researchers say. 

Proc. Natl Acad. Sci. USA http:// 
doi.org/ws6 (2014) 


| __NEUROSCIENCE 
Nostalgia rewards 
the brain 


Reminiscing about happy 
times is rewarding to the brain, 
and people will even give up 
money for the chance to enjoy 
some nostalgia. 

Mauricio Delgado and 
his colleagues at Rutgers 
University in Newark, New 
Jersey, asked volunteers to 
recall happy and neutral 
memories while their brains 
were scanned using functional 
magnetic resonance imaging. 
Participants spent more time 
recalling happy memories, 
and when doing so, their brain 
activity patterns were similar to 
those seen in people receiving 
money. When offered a small 
amount of money to recall a 
positive memory anda larger 
amount for a neutral memory, 
the volunteers were more likely 
to choose the happy memory. 

The researchers say that 
recalling good memories could 
be useful for improving mood. 
Neuron http://doi.org/ws2 (2014) 


ATMOSPHERIC SCIENCE 


Warming from soot 
overestimated 


Atmospheric soot may 

not have nearly as much 
climate warming potential as 
previously thought. 

Tiny carbon particles 
produced by biomass burning 
and incomplete combustion 
of fossil fuels absorb sunlight, 
warming the planet. Xuan 


RESEARCH HIGHLIGHTS Mii Saiaa¢ 


SOCIAL SELECTION 


Popular articles 
on social media 


Conference gender gap revealed 


Conferences are a central part of scientific life, but they are also 
an arena for gender disparities, according to a study proving 
popular on social media. Researchers in Australia gathered 
data from the 2013 Australasian Evolution Society meeting 
and found that male speakers tend to get a bigger share of the 
exposure — a conclusion shared by past studies of conferences. 
Even though roughly the same number of men and women 
attended and presented at the evolution conference, women 
spoke for less time and were also less inclined to ask for longer 
slots for their talks. Katie Hinde, an evolutionary biologist at 
Harvard University in Cambridge, Massachusetts, shared her 
take-home message on Twitter: “Ladies, request the long talk” 


PeerJ 2, e627 (2014) 


Based on data from altmetric.com. 
Altmetric is supported by Macmillan 
Science and Education, which owns 


Nature Publishing Group. 


Wang at the Massachusetts 
Institute of Technology in 
Cambridge and his colleagues 
used a refined chemical 
transport model, along with 
actual observations, to study 
black-carbon behaviour. 


They found that previous 


simulations have substantially 
overestimated concentrations 
of soot in remote regions 

as well as its global lifetime, 
leading to higher estimates 

of its warming potential. The 
authors conclude that the 
direct warming effect of black 
carbon might be less than 
one-quarter of the previously 
reported value. 


Policies aimed at reducing 


black-carbon emissions could 
have only a limited impact on 
mitigating climate warming, 
the team cautions. 

Atmos. Chem. Phys. 14, 
10989-11010 (2014) 


Vibrations yield 
new type of bond 


Calculations suggest that a 
new kind of chemical bond 
proposed in the 1980s might 
have occurred in a 2012 
experiment that coupled two 
bromine atoms to an exotic 


> NATURE.COM 
For more on 
popular papers: 
go.nature.com/qcdltf 


form of hydrogen. 

The molecule 
BrHBr is held together by 
weak electrostatic attractions 
known as Van der Waals’ 
forces. Jorn Manz at Shanxi 
University in Taiyuan, 
China, and his colleagues 
calculated what would 
happen if the hydrogen 
were swapped for a lighter 
isotope called muonium, in 
which a positively charged 
elementary particle called an 
antimuon takes the place of 
the proton. 

They predict that the 
BrMuBr molecule would 
be held together not by 
electrostatic forces but with 
a vibrational bond. The 
muonium shuttling between 
the bromine atoms would 
form a lower-energy system 
than the vibrations of MuBr 
alone. 

These calculations suggest 
that the bond might have 
been produced in the earlier 
experiment, which combined 
muonium and bromine. 
Angew. Chem. Int. Ed. http://doi. 
org/f2vjn6 (2014) 
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One of the gang 


Poland signed an agreement 
on 28 October to join 

the European Southern 
Observatory (ESO). The 
country is set to become 

the ESO’s 14th European 
member state once its 
parliament ratifies the 
agreement. Membership of 
the organization will give 
Polish astronomers access 
to the ESO’s ground-based 
telescopes. It will also allow 
companies in Poland to bid 
for contracts to build the 
39-metre European Extremely 
Large Telescope, which is 
being constructed in Cerro 
Armazones in Chile. Brazil, 
the ESO’s only non-European 
member, has yet to ratify an 
accession agreement that it 
signed in 2010. 


Climate warning 


The Intergovernmental 

Panel on Climate Change 
(IPCC) has warned of “severe, 
pervasive and irreversible 
impacts for people and 
ecosystems” if greenhouse-gas 
emissions are not substantially 
reduced over the next few 
decades. The warning, 
addressed to policy-makers, 

is included in the summary 

of the IPCC’s fifth assessment 
report of climate risks, 
released on 2 November in 
Copenhagen. The summary 
distills the latest contributions 
by the IPCC’s three working 
groups to the fifth assessment, 
as well as two special reports. 


Vaccine approval 
US regulators on 29 October 
approved a vaccine against a 
deadly strain of a bacterium 
that causes meningitis. 
Trumenba, produced bya 
subsidiary of Pfizer of New 
York, fends off infection 

by Neisseria meningitides 
serogroup B — a class 

of bacterium known for 


Lava invades Hawaiian town 


deployed last week to help to erect a roadblock, 
and about 20 families were told to evacuate 
their homes. By 30 October, the leading edge 
had stalled 155 metres from the road, but on 

2 November, the US Geological Survey reported 
active lava breakouts from parts of the flow. 
Kilauea is the most active volcano on Hawaii; its 
current eruption began in 1983. 


A creeping lava flow from the volcano Kilauea 
on Hawaii is threatening a community of almost 
1,000 people. The flow (pictured), which began 
on 27 June, has travelled roughly 20 kilometres 
and reached the town of Pahoa, where it has 
overrun pastures, a cemetery and private 
property on a course that is heading for the 
towns main road. The US National Guard was 


causing outbreaks among 
university students. Last 
December, the Food and Drug 
Administration allowed the 
use of a different meningitis 
vaccine that is not approved 

in the United States — but 

is available in Europe — at 
universities where outbreaks 
had occurred. 


Polar talks fail 


The Commission for the 
Conservation of Antarctic 
Marine Living Resources 
concluded its annual meeting 
in Hobart, Australia, on 

31 October, without agreeing 
on a plan to create a massive 
marine reserve in the Ross 
Sea. The commission, made 
up of representatives from 

24 countries and the European 
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Union, has failed 3 times 
before to agree on similar 
plans to ban fishing in what 
some researchers say is the 
most endangered area of the 
polar region (see Nature http:// 
doi.org/wtt; 2014). 


Emissions fines 


The US Environmental 
Protection Agency announced 
on 3 November that car 
makers Hyundai and Kia will 
pay the largest settlement 

ever for alleged violations of 
the US Clean Air Act. The 
companies are said to have 
sold nearly 1.2 million vehicles 
that will collectively emit 
about 4.75 million tonnes of 
greenhouse gases above the 
amount that the companies 
certified to the agency. In 


addition to a US$100-million 
penalty, they will also forfeit 
4.75 million previously claimed 
greenhouse-gas emissions 
credits, estimated to be worth 
more than $200 million. 


Spaceflight crashes 


US government investigators 
are probing the 31 October 
crash of Virgin Galactic 
rocketplane SpaceShipTwo, 
which killed one of the craft’s 
two pilots. The plane was on 

a test flight for commercial 
space travel. Three days earlier, 
a crewless Antares rocket 
exploded seconds after being 
launched from Wallops Island, 
Virginia, destroying scientific 
equipment and experiments 


USGS/GETTY 
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headed for the International 
Space Station. Orbital Sciences 
in Dulles, Virginia, which 
operated the Antares, is one 

of only two private companies 
with a NASA contract to fly 
cargo to the space station. See 
page 15 and Nature http://doi. 
org/ws5 (2014) for more. 


Political research 
On 28 October, the presidents 
of Stanford University in 
California and Dartmouth 
College in Hanover, New 
Hampshire, issued a public 
apology for a controversial 
political-science experiment 
in Montana. The study 

aimed to test whether voters’ 
decisions would be affected 

by pre-election flyers that 
characterized the political 
attitudes of state Supreme 
Court candidates. Roughly 
100,000 people received the 
flyer, which appeared to be an 
official government document 
because it bore the state’s seal. 
Stanford said that the research 
had not been submitted for 
approval by its institutional 
review board; both universities 
are investigating other possible 
violations. 


Lunar loop 


China successfully completed 
its first robotic mission 

to the Moon and back on 

1 November. Launched 

on 23 October, the probe 


TREND WATCH 


A global analysis suggests that 


human infectious-disease 


outbreaks are becoming more 


frequent and more diverse 


(K. Smith et al. J. R. Soc. Interface 
11, 20140950; 2014). A team led 
by Katherine Smith of Brown 
University in Providence, Rhode 
Island, found that the trends were 
significant even after correcting 


for changes in surveillance 
and reporting (for example, 


using indirect measures such 
as Internet use). However, the 
number of cases per person is 


falling, they find. 


flew around the Moon and 
survived re-entry into Earth's 
atmosphere to land safely in 
Inner Mongolia (pictured). 
The vehicle, nicknamed 
Xiaofei, or little flyer, had 

no scientific goals, but was 
intended to test technology 
for Change-5, a mission 

to return lunar samples to 
Earth planned for 2017. The 
success makes China the first 
country to fly a probe around 
the Moon and back since the 
Soviet Union in the 1970s. 


Falsified data 

The US Office of Research 
Integrity on 29 October 
reported findings that a 
former laboratory director at 
the US National Institute of 
Arthritis and Musculoskeletal 
and Skin Diseases in Bethesda, 
Maryland, had committed 
research misconduct. An 
investigation concluded that 
Bijan Ahvazi, previously 

the head of the Laboratory 


of X-ray Crystallography, 

had falsified data related to 
three publications. As part 
ofa settlement, Ahvazi has 
agreed to have his research 
supervised and to be excluded 
from peer-review committees 
for agencies such as the US 
National Institutes of Health 
for two years. 


Pp FUNDING 
Funding boon 


Germany’s universities and 
large science organizations 
have been promised a boost 
to help them to cope with 
rising student enrolment 

and the increasing costs of 
research. At their meeting 

on 30 October, science 
ministers from the federal 
government and Germany’s 
16 state governments pledged 
€25.3 billion (US$31.6 billion) 
over the next six years to 
continue special programmes 
for science and higher 


DISEASE OUTBREAKS ON THE RISE 


Both the number and diversity of outbreaks of human 
infectious diseases have risen since 1980. 
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SEVEN DAYS | THIS WEEK | 


8-9 NOVEMBER 

On the 25th anniversary 
of the fall of the Berlin 
Wall, the city holds 

the sixth Falling Walls 
Conference. Scientists 
from around the world 
discuss impending 
breakthroughs in areas 
including cancer care, 
neuroengineering and 
global energy. 
go.nature.com/beyrwm 


12 NOVEMBER 

The European Space 
Agency’s Rosetta craft 
attempts to land a 
washing-machine-sized 
probe on the surface of 
comet 67P/C-G. 


12-19 NOVEMBER 
The World Parks 
Congress takes place 

in Sydney, Australia, 
featuring sessions on the 
use of mobile technology 
in conservation, 
successes and challenges 
in rhino conservation 
and the future of 
privately protected areas. 
go.nature.com/c5ighb 


education. See go.nature. 
com/6qsa5n and page 17 
for more. 


Money for malaria 
The Bill & Melinda Gates 
Foundation announced on 

2 November a US$156-million 
contribution to the PATH 
Malaria Vaccine Initiative. 
The donation will support 

two lines of development: 
vaccines that prevent people 
from becoming infected 

after being bitten by infected 
mosquitoes, and transmission- 
blocking vaccines that prevent 
mosquitoes from becoming 
infected when they bite people 
with malaria — with special 
attention to vaccines that 
combine both features. 


> NATURE.COM 
For daily news updates see: 
WwW.nature.com/news 
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Great Barrier Reef splits explain apparent plateau hunting creationist sues science 25 years since the 
scientists p.16 in Liberian cases p.18 university p.20 Berlin Wall fell p.22 


COMMERCIAL SPACEFLIGHT 


Fledgling space industry 
resolute after fatal crash 


Virgin Galactic accident should not be allowed to stifle innovation, warn analysts. 


BY ALEXANDRA WITZE 


hat was already a bad week for 
commercial spaceflight turned 
tragic on 31 October with the fatal 


crash of SpaceShip Two, the rocket plane owned 
by Virgin Galactic that was meant to ferry 
tourists and scientists to the edge of space. The 
fledgling private space industry is reeling from 
the accident, in which the co-pilot, Michael 
Alsbury, died, and the pilot, Peter Siebold, was 
left seriously injured. The crash came just three 
days after the explosion ofan uncrewed Antares 
rocket that was launching on acommercial mis- 
sion to resupply the International Space Station. 

But analysts warn against letting the accidents 


curb spaceflight innovation. “It’s absolutely 
critical that people don't throw up their hands 
and say ‘it’s just too hard,” says Joan Johnson- 
Freese, a space-policy specialist at the US Naval 
War College in Newport, Rhode Island. 

As Nature went to press, the cause of the 
accident was still unclear. A 2 November press 
conference held by the US National Transpor- 
tation Safety Board (NTSB) focused on the 
tail booms. These are intended to help slow 
the rocket plane's descent when it re-enters the 
atmosphere, in a process known as feathering. 
Pilots are supposed to engage the feathering 
action when the craft is moving at roughly 
1.4 times the speed of sound, at the top of its 
parabolic arc of flight. The idea, inspired by a 


badminton shuttlecock, is to increase drag and 
allow the spaceship to descend safely. 

Two steps are normally required for feath- 
ering, said Christopher Hart, acting chairman 
of the NTSB, which is holding its first acci- 
dent investigation involving the commercial 
space industry. On the fateful SpaceShip Two 
flight, video footage retrieved from the cockpit 
shows that one of the pilots completed the first 
step when the vehicle was moving at only the 
speed of sound. The second action was not per- 
formed, but seconds later, the tail booms began 
moving to their feathered position anyway. 

Until that point, SpaceShipTwo had 
performed as expected. It took off from the 
Mojave Air and Space Port in California, 
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> hanging beneath its carrier aeroplane. It 
then detached, and fired its rocket motor. 
Within minutes, it had disintegrated and its 
pieces were scattered over an 8-kilometre-long 
swathe of the Mojave Desert. 

Among other goals, the flight was meant to 
test a new type of rocket motor. Virgin Galac- 
tic decided in May to switch SpaceShipTwo’s 
fuel from a rubber to a plastic base, in part to 
boost the spacecraft’s final altitude. The fuel 
reacts with liquid nitrous oxide to propel the 
rocket. Hart said that the fuel and oxidizer 
tanks were found intact. In 2007, an explosion 
during a ground-based test with nitrous oxide 
killed three workers at Scaled Composites, the 
Mojave-based company that designed and built 
SpaceShip Two. 

SpaceShip Two is a larger, eight-seat version 
of SpaceShipOne, the two-seater that won the 
US$10-million Ansari X Prize in 2004 on 
becoming the first private vehicle to repeatedly 
reach an altitude of 100 kilometres. 

Virgin Galactic had been selling seats on 
future flights, with up to 5 minutes of weight- 
lessness, for $250,000 each. Its customers 
include not only celebrities but also scientists 


who want to use it for microgravity studies. A 
second SpaceShip Two is about 60% built. Mean- 
while, a competing company, XCOR Aerospace 
of Mojave, is building a piloted space plane that 
would take tourists up one at a time. 

The accident underscores the complexities 
of private spaceflight, where engineering sys- 
tems are designed from scratch and tested in 
very public view. Many compare the endeavour 
to the early days of aviation, when aeroplane 
companies crashed time and again as they tried 
to commercialize air transportation. “To some 
degree, we are still in the infancy of spaceflight,” 
says Scott Hubbard, director of Stanford Uni- 
versity’s centre of excellence for commercial 
space transportation in Palo Alto, California. 

Commercial spaceflight will probably 
survive, analysts say, but only if the public is 
as willing to accept the risks as are aerospace 
experts. “I dont mind if it takes some time to 
develop,” says Alan Stern, a planetary scientist 
at the Southwest Research Institute in Boulder, 
Colorado, who has signed up for a research 
seat on SpaceShipTwo. “I'm quite convinced 
that commercial suborbital flight will be safe.” 
NASA is building a new launch system for 


astronauts, but it is space companies that are 
coming up with rockets and spacecraft designs. 

As the space-tourism industry tries to assess 
its future, another area of commercial space- 
flight is likely to keep forging ahead. Last week's 
failed Antares mission was part of a series 
of cargo flights to the International Space 
Station organized by Orbital Sciences of Dulles, 
Virginia, in partnership with NASA. An 
inquiry led by Orbital aims to find out why 
the rocket lifted briefly off its launch pad at 
Wallops Island, Virginia, before exploding and 
falling to the ground. 

The next planned step for private companies 
will be flying US astronauts, as well as cargo, 
to the space station. In September, SpaceX 
of Hawthorne, California, and Boeing of 
Houston, Texas, won a contract to begin fly- 
ing astronauts by 2017. Hubbard, who chairs a 
safety committee to review SpaceX’s upcoming 
crewed flights, notes that Orbital is not in the 
running for those. 

“Accidents with new technology are inevi- 
table, says Johnson-Freese. “How they are 
handled is the true test of innovation and 
innovators.” = 


CONSERVATION 


Future of Great Barrier Reef 
divides scientists 


Marine-park management comes under scrutiny as conservationists descend on Australia. 


BY DANIEL CRESSEY 


he health of the world’s most famous 
[svat of ocean real estate — the Great 

Barrier Reef Marine Park — will take 
centre stage next week as conservationists from 
around the world head to Sydney, Australia, 
for a once-in-a-decade meeting on ecosystem 
management. The park faces challenges, but 
scientists disagree over how endangered it is 
and how well it is being managed. Climate 
change further complicates the picture. 

Every ten years, the International Union for 
Conservation of Nature hosts the World Parks 
Congress to decide how to use parks to promote 
conservation (see page 28). On 12-19 Novem- 
ber, a particular focus will be how to enhance 
and expand marine parks (see ‘Marine parks 
on trial’). Yet the Great Barrier Reef, which was 
once held up asa shining example of ecosystem 
management, has run into trouble. 

Lying off the eastern coast of Australia, the 
park covers an area of ocean approximately the 
size of Germany, encompasses 3,000 coral-reef 


systems and is the largest ‘living structure’ on 
Earth. It is managed by the Great Barrier Reef 
Marine Park Authority (GBRMPA), which 
has divided it into zones that impose different 
restrictions on activities, such as scuba diving 
or fishing. 

This year, protests escalated over a proposed 
port expansion that 


would have dumped “The overall 
dredge material outlookfor the 
within the park Great Barrier 
boundaries. The plan Reef is poor and 


was abandoned but 
the United Nations 
Educational, Scientific & Cultural Organization 
(UNESCO) will next year be deciding whether 
damage done to the park through degrada- 
tion and development means that it should be 
included on the List of World Heritage in Dan- 
ger. In August, the GBRMPA itself published a 
report warning that “the overall outlook for the 
Great Barrier Reef is poor and getting worse’. 
Among the evidence for problems is a much- 
cited 2012 study showing that coral cover had 


getting worse.” 
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halved between 1985 and 2012 (G. Death et al. 
Proc. Natl Acad. Sci. USA 109, 17995-17999; 
2012). The report placed much of the blame on 
cyclones and unusually large swarms of crown- 
of-thorns starfish (Acanthaster planci), which 
eat reef-building corals. 

Some believe that much of the damage is 
temporary. Aaron MacNeil, who is studing the 
Great Barrier Reef at the Australian Institute 
of Marine Science in Townsville, points out 
that two huge cyclones — Hamish in 2009 and 
Yasi in 2011 — hit the reef in a particular way, 
producing a combined battering expected just 
once every 600 years. “I think in general the 
Great Barrier Reef is in pretty good shape but it 
has had a rough few years of storm activity that 
have left coral cover unusually low,” he says. 

Others, however, say that things are worse 
than they seem. Marine palaeoecologist John 
Pandolfi at the University of Queensland in 
Brisbane has been using sediment cores and 
other methods to reconstruct the reef’s history 
over the past 1,200 years. “I am afraid that if 
you compare the current state of the reef to the 
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The Great Barrier Reef is struggling to cope with the impacts of climate change and development. 


kinds of long timescales that my team looks 
at, then things are even worse than what you 
might have heard,” he says. 

Pandolfi’s team has documented declines 
in Acropora corals, which are vital to the reef 
framework, that date back to the 1920s (G. Roff 
et al. Proc. R. Soc. B. 280, 20122100; 2013). The 
losses are probably linked to changes in agri- 
culture that were brought in by European set- 
tlers and affected water quality and damaged 
the reefs. Current reports therefore may under- 
estimate declines in reef quality because they 
are often based on comparisons to a degraded 
state of the reef, rather than its truly pristine 
state, an issue known as shifting baselines. The 
GBRMPA is using Pandolfi’s work to try and 
address this problem. 

Modern pressures include the effects of 
nearby land development, such as run-off ferti- 
lizer from farming. Russell Reichelt, the GBRM- 
PAs chairman and chief executive, says that the 
threat from the proposed dredge dumping was 
overblown, but that the GBRMPA will encour- 
age government and local businesses to adopt 
a policy whereby their activities have a positive 


net impact on the reef, not just a neutral one, 
as is the case now. It is also introducing targets 
for maintaining habitats and species as well as 
systems for assessing cumulative impacts. 
Marine scientist Bob Kearney at the Univer- 
sity of Canberra says that designating the reef 
as a marine park has fostered an “inappropri- 
ate” focus on fishing, given the bigger threat 
posed by climate change. The reef is highly 
sensitive to changing temperatures and ocean 
acidification, but global action is needed to 
tackle the carbon emissions that are the root 
cause of these issues. Similarly, last week, the 
Australian Academy of Science criticized a 
multimillion-dollar reef sustainability plan 
drawn up by the Australian and Queensland 
governments because it “fails to effectively 
address” any of the major pressures on the reef. 
MacNeil is more optimistic, thanks to the 
current collaborative approach between the 
government, universities and the private sec- 
tor to solving the reef’s problems. “By working 
together I think we're in a better position to 
understand and address threats to the Great 
Barrier Reef than ever before,” he says. = 


MARINE PARKS ON TRIAL 


Indonesian corals in controlled test 


The effect of marine park areas (MPAs) — 
parts of the ocean that are protected and 
managed for conservation — is difficult 

to tease out, not least because ocean 
ecosystems are affected by so many 
variables. Now, a study akin to a randomized 
controlled trial is aiming to do just that 

in the remote Bird’s Head Seascape in 
Indonesia. 

Sitting in the ‘coral triangle’ north of 
Australia, the seascape contains more 
than 2,000 islands, many of which have 
remained undeveloped. Gabby Ahmadia, 


a marine scientist at the conservation 
organization WWF, has been searching for 
areas that can be used as controls for seven 
newly created MPAs within the Bird’s Head. 

Distance from fishing markets and 
exposure to waves, for example, must be 
matched between a park and its control. 
Ahmadia’s team will then measure factors 
such as fish biomass to determine whether 
the park designation is making a difference. 
“We really need to get to the question of are 
these MPAs working, and, when they are 
working, why,” she says. D.C. 
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Basket of gifts 
for German 
science 


Despite looming recession, 
research gets a windfall. 


BY QUIRIN SCHIERMEIER 


ermany’s economy may be shaky, 
(5 its science is in rude health. On 

30 October, ministers from the fed- 
eral government and Germany’s 16 states 
pledged €25.3 billion (US$31.6 billion) 
over the next 6 years to support special pro- 
grammes for research and higher education. 

With a research-and-development 
expenditure of about €100 billion per year 
— some two-thirds of which is corporate 
money — Germany is the fourth-highest sci- 
ence spender worldwide, behind the United 
States, China and Japan. Public funding 
for special programmes remained strong 
even after the global financial crisis of 2008. 
Now the windfall looks set to continue for 
the country’s universities and large science 
organizations, despite drops in exports and 
industrial production that suggest that the 
economy could tip into recession. 

The bulk of the money is for universities. 
Currently, some 2.6 million students are 
enrolled at Germany’s roughly 300 univer- 
sities — around 400,000 more than in 2005. 
First-year enrolment is projected to grow by 
360,000 or so by 2020. To cope, the federal 
government plans to continue a pact with 
the states, launched in 2007, to jointly con- 
tribute nearly €20 billion by 2020. 

Ministers have also agreed to continue 
a deal that guarantees 4 non-university 
research organizations, including the Max 
Planck Society, annual budget increases of 
3% until 2020. Since 2005, the combined 
budget of the four organizations, which run 
a total of 254 institutes and large research 
centres, has risen from €5.2 billion to 
€7.9 billion. The rises are not earmarked 
for particular fields, but national priorities 
include energy, specifically renewables, and 
health, owing to the ageing population. 

The federal government also plans to 
boost the budget of its main grant-giving 
agency for university research by 5% next 
year to €2.95 billion, and by 3% annually 
from 2016 on. And ministers agreed in prin- 
ciple that the German Excellence Initiative, a 
€4.6-billion university competition for top- 
up funds launched in 2006, may continue 
beyond its initial expiration year of 2017. = 
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The reality of the Ebola outbreak is not reflected by model projections of high case numbers. 


EPIDEMIOLOGY 


Models overestimate 
Ebola cases 


Rate of infection in Liberia seems to plateau, raising 
questions over the usefulness of models in an outbreak. 


BY DECLAN BUTLER 


r Vhe Ebola outbreak in West Africa has 
infected at least 13,567 people and killed 
4,951, according to figures released on 
31 October by the World Health Organization 
(WHO). Now, in a rare encouraging sign, the 
number of new cases in Liberia seems to be 
flattening after months of exponential growth. 
Scientists say it is too soon to declare that the 
disease is in retreat: case data are often unreli- 
able, and Ebola can be quick to resurge. But it 
is clear that mathematical models have failed to 
accurately project the outbreak’s course. 
Researchers are now struggling to under- 
stand whether reports of empty beds at 
treatment centres and declining burial num- 
bers are signs that fewer people are developing 
Ebola, or whether cases and deaths are going 
unrecorded. In Liberia's capital, Monrovia, just 


80 of 250 beds were filled at the Médecins Sans 
Frontiéres (MSF) centre last week. But Fasil 
Tezera, who heads MSF’s Liberia mission, is 
cautious: “The present epidemic is unpredict- 
able,” he says. 

Epidemiologists normally use mathemati- 
cal models to estimate the trajectory of an 
outbreak, and to estimate where and how to 
direct scarce medical resources. But for the 
current crisis, on-the-ground data contra- 
dict the projections of published models, says 
Neil Ferguson, an epidemiologist at Imperial 
College London, and a member of the WHO's 
multidisciplinary Ebola Response Team. 

On 7 October, for example, modeller 
Alessandro Vespignani of Northeastern Uni- 
versity in Boston, Massachusetts, and his 
collaborators predicted that Liberia would 
see 6,900-34,400 cases by 24 October, and 
9,400-47,000 by 31 October. But the WHO 
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put the number of reported cases in the 
country at just 6,535 as of 25 October. 

Vespignani says that his model was a worst- 
case scenario, in which exponential growth of 
cases continued and containment measures 
were ineffective. But he and other modellers are 
also handicapped by incomplete and unreliable 
data on Ebola epidemiology, especially in the 
hardest-hit areas. And they have little empirical 
data on how disease-control measures quanti- 
tatively affect Ebola transmission, says ecologist 
Nick Golding, who studies the spatial distri- 
bution of disease at the University of Oxford, 
UK. Models “are fitted to pretty poor-quality 
data on case counts, and essentially no data on 
interventions’, he says, making it difficult to 
generate accurate projections. 

Two more-complex models published last 
month attempted to tease out the effects of 
various control measures. But their outcomes 
also do not square with the most recent Libe- 
ria data (J. A. Lewnard et al. Lancet Infect. Dis. 
http://doi.org/wn9; 2014, and A. Pandey et al. 
Science http://doi.org/wts; 2014). That does 
not surprise Alison Galvani, an epidemiologist 
at Yale University in New Haven, Connecti- 
cut, and an author of both studies. “Epidem- 
ics are moving targets,’ she says, adding that 
her model projections are at best a prelimi- 
nary outline for public-health intervention. 
Because the model projections can be easily 
misunderstood, Ferguson says that modellers 
“really need to think carefully about what we 
really know about Ebola transmission and the 
impact of different interventions, and do our 
best to communicate the many uncertainties”. 

In the meantime, Bruce Aylward, a WHO 
assistant director-general who is coordinating 
the agency’s Ebola efforts, is “terrified” that any 
plateau in new cases will be misinterpreted as 
meaning that the problem is going away. There 
is still a need to greatly increase the resources 
available to treat infected people and prevent 
new cases, Aylward says. 

But ifthe slowing rate of infection in Liberia is 
confirmed, it could suggest that even moderate 
levels of public-health intervention can pay off, 
says Golding. For the current Ebola outbreak, 
the average number of new cases spawned by an 
infected individual — 1.2-2.2 — ismuch lower 
than that of many other communicable diseases, 
such as measles (which can spread to between 
12 and 18 people per case). As Ebola preven- 
tion measures push down this figure, the disease 
becomes easier to control; when it dips below 1, 
virus spread stops completely. 

Until the West African outbreak is extin- 
guished, there is a real risk that the disease will 
resurge in areas where it has been stamped out 
— or even cover new ground. A stark reminder 
of this came in the past two weeks: a two-year- 
old girl with Ebola travelled hundreds of 
kilometres from Guinea to Mali on a bus — 
raising concerns that the many people she 
came into contact with could spark outbreaks 
in Mali. = 
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Lobbying sways 
NIH grants 


Pressure on lawmakers from patient-advocacy groups has 
shaped agency spending on rare-disease research. 


BY SARA REARDON 


dvocates for patients with rare diseases 
Az millions of dollars lobbying the 

US Congress each year — and it is 
money well spent, an economic analysis has 
found. Between 1998 and 2008, such efforts 
helped to increase new funding for rare- 
disease programmes by 3-15% each year at the 
US National Institutes of Health (NIH), accord- 
ing to a report to be published in Management 
Science (D. Hegde and B. N. Sampat Mgmt Sci. 
http://doi.org/fzs2vx; 2014). 

The effect of this growth on the NIH’s total 
budget (US$30 billion in fiscal year 2014) 
is small; targeted grants accounted for just 
10-20% of the agency's annual grant-making 
during the study period. But the analysis high- 
lights the fine line that the NIH must tread when 
choosing diseases to prioritize: maintaining the 
peer-review process by which it awards grants, 
but not ignoring the wishes of the lawmakers 
who control its budget. 

Congress curbed the power of lobbyists 
through a 2010 ban on setting aside money in 
bills for specific projects — known as earmarks. 
But lobby groups have shifted strategies. They 
seek to steer funds using ‘soft’ earmarks: lan- 
guage in spending bills that encourages or urges 
an agency to perform some action, such as 
funding Alzheimer’s research, rather than set- 
ting aside funds for it (see ‘Research rewards’). 

When economists Deepak Hegde of New 
York University and Bhaven Sampat of Colum- 
bia University in New York City examined the 
text of congressional reports on the NIH budget 
for mentions of 955 rare diseases, they found 
an average of 84 soft earmarks a year for these 
conditions. The true impact of lobbying on the 
NIH budget is likely to be even larger than their 
estimates, they say, because the study examines 
just a small slice of the agency's research portfo- 
lio that does not include diseases such as cancer. 

Rachel Best, a sociologist at the University 
of Michigan in Ann Arbor, says that lobbying 
provides a way for taxpayers to communicate 
their priorities to the NIH — and it pays. She 
analysed funding for 53 diseases over the course 
of 19 years and found that each $1,000 spent 
on lobbying for a condition correlated with a 
$25,000 increase in funding the following year 


RESEARCH REWARDS 


Lobbying by patient-advocacy groups is linked to 
a rise in ‘earmarks’ in bills funding rare-disease 
research at the US National Institutes of Health. 
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(R. Best Am. Sociol. Rev. 77, 780-803; 2012). 

This could be problematic if it starts to skew 
funding towards research that has little scientific 
merit or comparatively small public-health ben- 
efit, Best adds. She notes that the diseases that 
affect the most people do not necessarily get the 
strongest advocacy: she has found that illnesses 
that primarily affect women and minorities, and 
stigmatized conditions such as sexually trans- 
mitted diseases, tend to receive less lobbying 
support than other conditions. 

Jeremy Berg, a biochemist at the University of 
Pittsburgh in Pennsylvania and a former direc- 
tor of the National Institute of General Medical 
Sciences, says that he is neither surprised nor 
concerned by the findings. If Congress uses a 
soft earmark to encourage research on a par- 
ticular topic, he says, “it’s something NIH takes 
seriously” — but does not feel compelled to 
act on. “In my experience, it’s not regarded as a 
command, but input into what to do” 

And Pierre Azoulay, an economist at the 
Massachusetts Institute of Technology in 
Cambridge, says that it would be interesting 
to determine whether targeted grant pro- 
grammes for specific diseases chosen by the 
NIH yield better results than grants proposed 
by individual researchers. “To know whether to 
be exercised about this, we should look at how 
scientifically fruitful those different grants are,” 
he says. Hegde says that his group is currently 
doing just this for all NIH research funding. m 
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Mark Armitage was fired from his job as a lab technician at California State University, Northridge. 


SCIENCE AND RELIGION 


University sued 
by creationist 


Microscopist’s wrongful-dismissal case faces long odds. 


BY CHRISTOPHER KEMP 


discovery that he had dreamed of for years. 

While digging in Montana, he uncovered 
one of the largest triceratops horns ever found 
in the Hell Creek Formation, a legendary stack 
of fossil-bearing rocks that date to the last days 
of the dinosaurs. Armitage drove the horn 
back home to Los Angeles, California, where 
his microscopic examination revealed that it 
contained not only fossilized bone but also pre- 
served layers of soft tissue. “They were brown, 
stretchy sheets. I was shocked to see anything 
that was that pliable,” he says. 

In February 2013, he published his findings 
in Acta Histochemica, a journal of cell and tissue 
research (M. H. Armitage and K. L. Anderson 
Acta Histochem. 115, 603-608; 2013). Two 
weeks later, he was fired from his job at Califor- 
nia State University, Northridge (CSUN), where 
he managed the biology department's electron 
and confocal microscopy suite. 

Now he is embroiled in a long-shot legal fight 
to get his job back. In July, his lawyers filed a 
wrongful-termination suit claiming that reli- 
gious intolerance motivated the dismissal: as 
a young-Earth creationist, Armitage says that 
finding soft tissue in the fossil supports his belief 
that such specimens date to the time of the bibli- 
cal flood, which he puts at about 4,000 years ago. 

The suit alleges that faculty members hos- 
tile to Armitage had him fired because they 
could not stand working with a creationist who 
had been published in a legitimate scientific 


lE May 2012, Mark Armitage made a 


journal. He and his attorneys at the Pacific 
Justice Institute, a conservative legal organi- 
zation based in Sacramento, California, that 
focuses on religious and family issues, have 
repeatedly made that claim in the press. But 
specialists in US labour law suggest that his 
claim of religious intolerance might have dif- 
ficulty standing up if the case goes to trial. 

In recent years, a schoolteacher, academic 
and NASA employee who were creationists have 
claimed that they were fired unjustly for their 
religious beliefs. (None were reinstated.) But 
what makes this case different is that Armitage 
managed to survive for years in a mainstream 
academic institution and to publish research in 
a respected peer-reviewed journal. 

Armitage acknowledges that he did that by 
keeping his views on the age of the fossil out 
of the paper. Written with biologist Kevin Lee 
Anderson of Arkansas State University-Beebe, 
the study simply reported that the horn was 
found in Hell Creek (which has a well-accepted 
age of 65 million to 70 million years). “It was 
just morphology,’ says Mary Schweitzer, a 
palaeontologist at North Carolina State Univer- 
sity at Raleigh who reviewed the work before 
publication, and made the first discovery of soft 
tissue in dinosaur bones in 2005. “It was fine.” 

Creationists often appeal to soft-tissue pres- 
ervation as evidence that dinosaur fossils are 
thousands rather than millions of years old, says 
palaeontologist Jack Horner of the Museum of 
the Rockies in Bozeman, Montana. “Science is 
about building hypotheses and then attempting 
to falsify them, he says. “Creation science or any 
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kind of pseudoscience is just the opposite. It is 
coming up with an idea or a notion or anything 
else and finding evidence to support it” 

According to his lawsuit, Armitage never 
tried to conceal his beliefs from his employer. 
The filing says that when he was interviewed for 
his job at CSUN in November 2009, he did not 
hide that he holds degrees from the Christian- 
fundamentalist Liberty University in Lynch- 
burg, Virginia, and the Institute for Creation 
Research, previously in San Diego, California. 
In an interview with Nature, Armitage said 
that he was also equally open about his roughly 
30 technical articles on microscopy and his 2008 
self-published book Jesus is like my Scanning 
Electron Microscope. 

The lawsuit also claims that Armitage 
excelled in his job, receiving numerous letters 
of commendation. “I’m nota microscopist but 
as far as I could tell, Armitage was a good one,’ 
Paul Wilson, a biologist at CSUN, told Nature. 

CSUN declined to comment on Armit- 
age’s performance or its reasons for ending his 
employment. However, Jeffrey Noblitt, associ- 
ate vice-president of marketing and commu- 
nications at CSUN, did stress in an email that 
Armitage’s position had been “temporary”. 

Armitage freely admits that he often engaged 
students in conversations, giving his opinion on 
issues such as the age of the remarkably well- 
preserved cells in the triceratops horn. “To me, 
the obvious conclusion is they're young. They 
cant be 68 million years old,” he says. 

In terms of getting his job back, those conver- 
sations might be Armitage’s undoing. US anti- 
discrimination laws require employers to 
reasonably accommodate an employee's beliefs 
or religious practices, unless doing so would 
cause ‘undue hardship’ to the employer, says 
Justine Lisser, a spokesperson for the US Equal 
Employment Opportunity Commission. 

If Armitage made his living bending metal 
ina machine shop, an employer would find it 
difficult to show how his views caused undue 
hardship, she says. But in an academic setting, 
telling biology or palaeontology students that 
life began only a few thousand years ago more 
clearly undermines the institution’s goals. “It 
would be an easier showing of undue hard- 
ship,” says Lisser, “because it’s more related to 
the essence of what the person is doing.” m 


CORRECTION 

In the News story ‘Geneticists tap human 
knockouts’ (Nature 514, 548; 2014), the 
team investigating knockouts found some 
200,000 variations that knocked out genes 
(not 150,000 genes, as stated). In addition, 
the team that studied 36,000 Finnish people 
was not led by Daniel MacArthur, he was just 
a member of the group. Although Marfan 
syndrome can cause serious heart problems, 
it rarely results in sudden heart failure. 
Finally, Bing Yu is awoman nota man. 


DAVID WALTER BANKS 
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Inthe 25 yedrs since the 
collapse of communism, 
the countries of central 
and Eastern Europe have 
each carved their own 
identity in science. 


BY ALISON ABBOTT & } 
QUIRIN SCHIERMEIER 
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paint, Katarzyna Komorowska expertly 
handles what looks like a futuristic coffee 
machine. It is actually an advanced scanning 
electron microscope with the power to manip- 
ulate delicate samples and visualize minute 
details — one of several impressive-looking 
machines in Komorowska’s lab in the city of 
Wroclaw in southwest Poland. Komorowska 
turns on the device's ion beam. Minutes later, 
a screen shows the razor-sharp image of a 
bearded dwarf clutching a graphene molecule 
that she has just engraved ona grain of sand. 
The etched sand is a historical reminder as 
well as a technological feat. The dwarf became 
an unlikely symbol of the 1980s protest move- 
ment that grew in Wroclaw against Poland's 
ruling communist regime. It is now something 
of a city mascot: Wroctaw hosts more than 
300 dwarf statues, and visitors can track them 
down using a brochure and app. The fact that 
the dwarf can be engraved ona grain of sand 
in seconds also symbolizes the formidable 
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efforts that this city is making to become a 
science hub in central Europe. Since 2007, more 
than €200 million (US$250 million) in Euro- 
pean Union (EU) funds have helped to turn 
Wroclaw’s abandoned military hospital into a 
campus dedicated to academic and commercial 
science — just one part of Poland’s high-flying 
ambitions for science as a whole. 

Change has swept through central and East- 
ern Europe since the collapse of communism 
there 25 years ago. The revolution was quick 
and unforeseen. For a few months in 1989, 
protests swelled behind the Iron Curtain, the 
political barrier that since the end of the Sec- 
ond World War had isolated communist cen- 
tral and Eastern European countries from the 
West. Then, on 9 November that year, the East 
German government opened the Berlin Wall 
and first a trickle — then a flood — of East 
and West Germans began to scale the barrier, 
delirious with joy. A year later, Germany had 
been reunified and almost every other former 
communist country in the region had instituted 
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a democratic gov- 
ernment. 
Researchers 
shared in the ela- 
tion: the fall of the 
Iron Curtain brought them personal and intel- 
lectual freedom. But it came with a host of new 
problems. During the 45-year communist rule, 
research institutions from the Baltic to the Bal- 
kans had been academically isolated and unable 
to compete with the rest of the world. Now they 
were suddenly being judged by international 
standards, and their science looked hopelessly 
out of date. For many, political change also 
brought poverty, as economies collapsed. Piti- 
fully low salaries, lack of funding and antiquated 
labs prompted swathes of scientists to go west or 
seek careers outside academia. Those who 
stayed relied almost exclusively on foreign aid. 
“After the Iron Curtain had come down, science 
and higher-education institutes were thrown 
into turmoil? says Liviu Mattei, pro-rector of the 
Central European University in Budapest. “Few 


Scanning electron 
microscope image of a 
grain of sand engraved at 
EIT+ in Wroctaw, Poland. 
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places in the world have gone through such 
rapid and brutal changes.” 

Twenty-five years on, researchers find them- 
selves in a more stable scientific landscape. 
The economic decline of the 1990s has mostly 
ended, and in the past decade some countries 
have enjoyed a marked economic upswing 
that has allowed governments to inject money 
into science. Membership of the EU has been 
a major driver of change. In 2004, the union 
welcomed eight former communist coun- 
tries, including Poland, Estonia and Hungary. 
Romania and Bulgaria followed in 2007, and 
Croatia in 2014. One EU citizen in five now 
lives in one of these new member states. 

These relatively poor countries have enjoyed 
huge financial injections from EU structural 
funds, which are designed to narrow economic 
and social disparities between European regions 
and are distributed by each country’s govern- 
ment. In the 2007-13 financial period, Brussels 
invested a staggering €170 billion in cohesion 
and regional development in the new member 
states, and more than €20 billion of this was ear- 
marked for science and innovation. Most coun- 
tries have also created funding agencies that 
allocate grants on a strictly competitive basis. 
“Scientists had to learn that performance is now 
the sole basis of getting funded and published,” 
says Franci Deméar, director of the Slovenian 
Research Agency in Ljubljana. “It has been a 
difficult process, but it has greatly improved 
science produced in this part of the world” 

But within central and Eastern Europe, dif- 
ferent nations have followed starkly different 
trajectories in science, as a spotlight on three 
countries in the region reveals (see ‘Science in 
the new Europe’). Poland hosted relatively lit- 
tle research until recent years, but the nation is 
now becoming a political and economic power- 
house in the region and is rapidly expanding in 
science. Estonia, a small country on Europe's 
northern fringes, reformed its research system 
early on and is now reaping the benefits. Hun- 
gary, by contrast, maintained some scientific 
strengths during the communist era, but a lack 
of investment is now putting that legacy at risk. 

This means that when it comes to science, 
central and Eastern European countries — so 
similar to each other in their communist days 
— are growing steadily apart. What is more, 
almost all are still fighting a brain drain to the 
West. “The talent for science is all there; says 
Lars Walloe, a physiologist at the University of 
Oslo and former president of the Academy of 
Europe. “Now the conditions and institutions 
in the region need to develop in such a way that 
the best minds will find it worthwhile to stay.’ 


POLAND: POCKETS OF EXCELLENCE 

Poland has embraced science like few other 
countries in the region, as is evident on the 
Wroclaw campus with its dwarf-engraving 
machine. The campus has the air of a sprawl- 
ing, half-built start-up company. Extensive lab 
spaces are still under construction and will be 


opened to scientists and entrepreneurs next 
year. On a rainy September morning, labs and 
meeting rooms are buzzing with scientists test- 
ing equipment and discussing results. 

The campus is called EIT+, to echo the 
Budapest-based European Institute of Inno- 
vation and Technology, an EU effort to create a 
network of research powerhouses. Scientists at 
EIT+ pursue independent research in subjects 
including nanotechnology, materials science 
and biotechnology. But the campus operates as 
a limited company that aims to provide indus- 
try with research and services such as micro- 
scopy and crystallography, at a profit. “Twenty 
years ago the kind of things we're doing would 
have been unthinkable,” says Jerzy Langer, 
head of EIT+ and a former Polish deputy 
science minister. “The time has long gone that 
scientists here could say ‘Sorry, I can't do this 
or that, I haven't got the money and the tools” 

Komorowska joined EIT+ in 2012. She had 
trained in Wroclaw, but left the country for 
postdoctoral work in France and Belgium, 
with no plans to return. She changed her mind 


AFTER THE IRON 
CURTAIN HAD COME DOWN, 


SCIENCE WAS THROWN 
INTO TURMOIL. 


when she got a job offer from the newly created 
Wroclaw Research Centre, part of EIT+. She 
now leads the centre’s laboratory of nanotech- 
nology and semiconductor structures, where 
she is developing an automated system for ana- 
lysing the mineral and metal content of rocks 
— data of use to the massive Polish mining 
industry. Some 35 million zloty (US$10.5 mil- 
lion) are being spent on furnishing the lab with 
the latest electron microscopes for character- 
izing and observing materials. “The conditions 
to do science in Poland have improved enor- 
mously now that we have the same equipment 
as most people in the West,” says Komorowska. 
That was not true in the communist era, when 
the country operated just a few basic-research 
institutes. The situation began to change in 
1990, when Lech Walesa, leader of the trade 
union Solidarity, took over as Polish president 
and began to modernize the country. Poland 
went through a painful transition when democ- 
racy and a market-based economy arrived — 
and science was shaken to the core. Seeking 
more lucrative opportunities, thousands of 
researchers went into business or left to pursue 
academic careers abroad. What remained of the 
communist research base was jealously guarded 
by an increasingly inward -looking group of age- 
ing academics and produced little in terms of 
internationally competitive science. 


FEATURE | NEWS 


The situation really turned around 
when Poland joined the EU in 2004. With 
38.5 million inhabitants, the country is by far 
the most populous member state in the region, 
and also receives the most EU structural funds. 
That money has helped to fuel its remarkable 
economic growth, which has been outpacing 
that of most other European countries since 
2008. Science has ridden on the coat-tails of the 
country’s thriving economy, and the govern- 
ment has recognized that research is an impor- 
tant route to further growth. Domestic funding 
has doubled over the past five years, although 
overall science expenditure is still very low 
compared to other places in Europe, at less than 
1% of gross domestic product (GDP). 

Outside Wroctaw, the picture is not 
entirely rosy. Polish science still has a work- 
force shortage, with fewer than 4 researchers 
per 1,000 people in the labour force — well 
below the EU average of just under 7. Scien- 
tists acknowledge that many of the country’s 
research institutes — particularly the 80 run by 
the Polish Academy of Sciences — are reluctant 
to reform. What is more, says Langer, at some 
university departments a spirit of obedience 
lingers from the communist days and tends to 
stifle creativity. “Many students are too shy,” 
he says. “They think that ideas are no good if 
the authorities haven't rubber-stamped them.” 

To address the problem, Poland’s National 
Centre for Research and Development now 
runs a programme designed to bring back early- 
career Polish scientists who trained abroad, and 
to attract foreign scientists, by offering them 
up to 1.2 million zloty to start an independent 
group. (Securing this funding was another rea- 
son that Komorowska was persuaded to return.) 
And there is a wave of fresh talent on the way. 
The number of science graduates from Polish 
universities has more than doubled over the 
past decade, and overall student numbers have 
more than quadrupled since 1990. Every tenth 
student in the EU is now Polish. “This is most 
encouraging,’ says Langer. 

Scientists in Poland still want to see better 
standards of science education, and more inno- 
vative institutes such as EIT+. “There is suffi- 
cient money available now to support promising 
ideas, but we need to create institutional envi- 
rons in which science can flower,’ says Janusz 
Bujnicki, a molecular biologist at the Interna- 
tional Institute of Molecular and Cell Biology in 
Warsaw. “That task has only just begun” 


ESTONIA: SMALL AND FOCUSED 

Andres Metspalu, who runs one of the world’s 
most sought-after banks of human DNA, says 
that his life began at 40. That was in 1991, the 
year Estonia declared its independence from 
the Soviet Union and began its bumpy path 
towards a Western-style research base. 

Before this, Metspalu’s ambition to do world- 
class science had been constantly undermined 
by the absurdities of the isolationist Soviet sys- 
tem. That system had acknowledged his talent: 
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SCIENCE IN THE 
NEW EUROPE 


Since 2004, 11 former communist states 
have joined the European Union, gaining 
funding that has aided an economic 
upswing and increased investment in 
science. But they have followed different 
trajectories. Some are fully embracing 
modern science and are almost equal 
partners in Europe; others lag behind. 
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produce about 4% of the 
world’s publications, with 
Poland taking the lion’s share. 
This is still dwarfed by Western 
powerhouses such as Germany 
and the United Kingdom. 


it selected him in 1981 as one of only 25 young 
scientists across the entire Soviet Union to 
spend a year training in a US laboratory after 
his doctorate. (He studied biochemistry at 
Columbia University in New York, then at Yale 
University in New Haven, Connecticut.) But 
his young family had to stay in Tartu, Estonia, 
so that the authorities could be confident that 
he would not defect. And after that, he was for- 
bidden further travel until Mikhail Gorbachev 
swept to power in the Soviet Union in 1985. 
Over the next few years, Metspalu watched the 
Soviet Union fall apart. 

With a population of just 1.3 million people, 
Estonia found change easier than some larger 
countries. This, and some good political 
decision-making, helped to make the country 
one of the first in the region to reverse its eco- 
nomic decline. The government was prudent 
enough to stimulate business-orientated 
research and entrepreneurship early on. Newly 
created research centres and technology parks 
in Tartu and the capital, Tallinn, became the 
nuclei for a remarkable scientific upswing. 

After the human genome was sequenced 
at the turn of the millennium, Metspalu, 
who worked at the University of Tartu, saw 
a scientific opportunity. Thanks to support 
from Estonia’s EU structural funds, he was 
able to launch a project to recruit individu- 
als to donate their genetic and health data to 
a national biobank. In other countries, such 
as Iceland, similar efforts to collect personal 


information en masse were met with suspicion. 
But that was not the case among the newly 
liberated and optimistic people of Estonia, 
many of whom were happy to sign up. The 
biobank now includes genetic and health 
information on 5% of the country’s adult popu- 
lation and is a valuable international resource 
in studies that require very large numbers 
of people to identify risk genes associated 
with common diseases, including obesity! 


TWENTY-FIVE YEARS AGO 
I'D HARDLY HAVE IMAGINED 


CONTRIBUTING T0 A 
BIOMEDICAL REVOLUTION. 


and schizophrenia’. The database has been 
“extremely helpful”, says Michael O'Donovan, 
a psychiatrist at Cardiff University, UK, who 
was involved in the schizophrenia study. 

As well as biotechnology, the Estonian 
government is focusing scientific investment 
on disciplines such as materials science and 
informatics. The chemistry department at the 
University of Tartu, for example, is prominent 
in the area of superacids and superbases, use- 
ful in the development of batteries for electric 
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Poland has undergone 
sharp economic growth 
and is rapidly enlarging 
its research base. 
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cars, and has collaborations with car manu- 
facturers in several countries. Estonia has 
steadily increased its investment in research 
and development, from 0.72% of the GDP in 
2002 to 2.18% in 2012 — the second highest 
in central and Eastern Europe, after Slovenia. 


The country is keen to reap the rewards of 


its investments by using the biobank for more 
than just research. This year, the government 
formally pledged to support a project that 
would — in the next few years — link the 
repository with Estonia’s centralized health 
database to allow physicians to support their 
diagnoses and therapies with individual 
genetic information. If all goes to plan, this will 
put Estonia among the world’s front-runners in 
personalized medicine. “Twenty-five years ago 
Id hardly have imagined being able to contrib- 
ute to a biomedical revolution,” says Metspalu. 


HUNGARY: SCIENCE ON A SHOESTRING 
Last year, plant biologist Eva Kondorosi 
decided to pack up her life in Paris and take her 
European funding to the Hungarian Academy 
of Sciences (HAS) Biological Research Centre, 
where she had begun her research career in 
the late 1970s. But she found something odd: 
unlike many research institutes across central 
and Eastern Europe, this one seemed to be less 
exciting in the modern era than it had been in 
the communist one. “It has lost the vibrancy we 
enjoyed there in the 1970s and 80s,” she says. 
If that sounds paradoxical, it is because 
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EUROPEAN RESEARCH COUNCIL GRANTS 
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Bulgaria and Romania have been slow to 
reform research since the communist era. 


2006 2009 


science in Hungary is a bit of a paradox. During 
the communist years, research survived against 
the odds. It was separated from universities and 
centralized in institutes — run by the highly 
politicized HAS — that tended to appoint cro- 
nies and members of the ruling communist 
party to key research posts. At the end of the 
1960s, however, the HAS made a bold deci- 
sion to start afresh in biology by creating the 
Biological Research Centre in the southern 
city of Szeged, close to Hungary’s southern 
border and away from the stifling politics of 
the capital. Staff members at the multidiscipli- 
nary centre were appointed on merit, not politi- 
cal status. The centre became a safe haven for 
intellectuals, and Kondorosi did a doctorate 
in plant sciences there. The country’s borders 
were slightly more open than those of most of 
its communist counterparts, and throughout 
the 1980s researchers could gain permission to 
visit the West. This kept the science in Szeged 
cutting-edge, and the atmosphere buzzing. 

After the fall of communism, lack of invest- 
ment dampened enthusiasm, and many scien- 
tists left. Funding has never fully picked up. 
Hungary is one of only three countries to have 
reduced its public spending on research since 
2007 (the others are Croatia and Bulgaria), 
and it dedicated just 8.5% of its 2007-13 EU 
structural funds to research — compared to 
Estonia's 20% and Poland’s 14%. What is more, 
much of the structural funding has gone to 
companies, not academia. 


Despite this, the country’s rich academic 
legacy still attracts the best Hungarian scien- 
tists. Hungary has hosted more researchers 
with prestigious European Research Council 
grants — including Kondorosi — than any 
other former communist EU country. Kon- 
dorosi returned because the interdisciplinary 
organization of the Biological Research Centre 
offered her the opportunity to take the lessons 
she had learnt from studying plant-bacteria 
symbiosis and apply them to medicine. She 
and others have discovered antimicrobial 
activity in some of the bacteria that plants use 
for nitrogen-fixing’, for example. 

Hungary’s scientific culture has inspired 
international confidence. In 2012, CERN, the 
European laboratory for particle physics, built 
an advanced data centre close to Budapest. 
And Szeged is going to be home to one of the 
three nodes of the Extreme Light Infrastruc- 
ture, a collaborative EU project to advance 
laser science (see Nature 489, 351; 2012). 

Scientists are hopeful that the climate for 
science is warming up. In June this year, the 
charismatic Jézsef Palinkas was appointed to 
the new position of government commissioner 
for science and innovation. Palinkas was for- 
merly president of the HAS, where in 2011 he 
forced through reforms that streamlined the 
academy’s 40 research units into 15 larger cen- 
tres, and increased scientific competition for 
funding. In his new role he will be responsible 
for advising the government on science policy, 
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as well as coordinating the spending of the 
current round of research-related structural 
funds. In this round, he says, around 12% will 
be directed to research. 

Scientists in Hungary and elsewhere have 
their eyes on the EU’s Horizon 2020 pro- 
gramme, the €80 billion of research funding 
that started this year and will last until 2020. 
In a bid to widen participation, Brussels has 
launched a ‘teaming’ scheme that allows less 
potent member states to create or upgrade 
competitive research centres in partnership 
with leading institutions from other countries. 

Financial and organizational aid will 
remain crucial in narrowing the gaps between 
countries, says Walloe. “Some places will 
always have more capacity of science than 
others,” he says. “But every country should 
have at least one thing or the other that is 
really good. That is how science is organ- 
ized in the United States — and that’s what it 
should be like in Europe.” mSEEEDITORIALP.7 


Alison Abbott is Nature’ senior European 
correspondent and Quirin Schiermeier is 
Natures German correspondent. Both are 
based in Munich. 
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COMMENT 


Experts weigh in Loss of natural and The fame and 
on priorities for the world’s ecological capital leaves hole shame of the elements 
protected areas p.28 in national budgets p.32 that never were p.34 


( Carl Djerassi, 
father of the Pill, reflects on 
his life and literature p.36 


Conservation efforts risk getting snared in a tangle of aims. 


A call for inclusive 
conservation 


Heather Tallis, Jane Lubchenco and 238 co-signatories 
petition for an end to the infighting that is stalling 
progress in protecting the planet. 


nage-old conflict arounda seemingly 
A simple question has resurfaced: why 
do we conserve nature? Contention 
around this issue has come and gone many 
times, but in the past several years we believe 
that it has reappeared as an increasingly acri- 
monious debate between, in essence, those 
who argue that nature should be protected 
for its own sake (intrinsic value)” and those 
who argue that we must also save nature to 
help ourselves (instrumental value)*®. 
Champions of instrumental value contend, 
among other things, that protecting nature 
for its own sake alone has failed to stem the 
tide of species extinction, that conservation 


should be open to partnering with business 
to effect the greatest change and that conser- 
vation support will be broadened by more 
directly considering other social objectives 
(such as food security or clean water). By 
contrast, advocates of intrinsic value assert 
that ethical arguments for conservation 
should be sufficient, that partnering with 
business is selling out to those who create 
the problem and that social considerations 
are already central to conservation. 
Unfortunately, what began as a healthy 
debate has, in our opinion, descended into 
vitriolic, personal battles in universities, 
academic conferences, research stations, 


6 


© 2014 Macmillan Publishers Limited. All rights reserved 


conservation organizations and even the 
media’. We believe that this situation is 
stifling productive discourse, inhibiting 
funding and halting progress. 

Adding to the problem, in our view, is the 
issue that this dispute has become domi- 
nated by only a few voices, nearly all of them 
men’. We see this as illustrative of the bigger 
issues of gender and cultural bias that also 
continue to hinder conservation. 

The stakes? The future of conservation 
science, practice and policy. Conservation 
regularly encounters varied points of view 
and a range of values in the real world. To 
address and engage these views and values, 
we call for more-inclusive representation of 
scientists and practitioners in the charting 
of our field’s future, and for a more-inclusive 
approach to conservation. 


EMBRACE DIVERSE VALUES AND VOICES 
Women historically have been under- 
represented in environmental-science 
faculty positions and in conservation 
practice, as in most scientific fields. This 
disparity is changing globally, but at dif- 
ferent rates: more slowly in Asia and more 
quickly in Latin America and the Caribbean, 
for example’. In the United States, more than 
half the leadership positions in conservation 
organizations are now held by women. And 
on the global stage, women currently hold 
top positions in many leading efforts, includ- 
ing the Intergovernmental Platform on Bio- 
diversity and Ecosystem Services, the Future 
Earth science committee, and the Interna- 
tional Union for Conservation of Nature. 
This progress makes the dearth of female 
voices in the debate about the premise of our 
profession all the more stark. 

The signatories in agreement here — 
women and men from around the globe 
— support an equal role for women and 
for practitioners of diverse ethnicities and 
cultures in envisaging the future of conserva- 
tion science and practice. 

Together, we proposea unified and diverse 
conservation ethic; one that recognizes and 
accepts all values of nature, from intrinsic to 
instrumental, and welcomes all philosophies 
justifying nature protection and restoration, 
from ethical to economic, and from aesthetic 
to utilitarian. What we propose is not new. 
This diverse set of ethics has a long-standing 
history in modern conservation*. For 
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> example, more than 100 years ago, both 
intrinsic and instrumental values were used 
in the creation of Yellowstone National Park 
in Wyoming, and when Californians spurred 
the broader environmental movement in the 
United States by using economic studies of the 
value of birds alongside compelling speeches 
about the purity and grandeur of nature’. 

These values need not be in opposition, 
although they do reflect the hard choices that 
conservation often faces. They can instead be 
matched to contexts in which each one best 
aligns with the values of the many audiences 
that we need to engage. Those on the side of 
intrinsic value will argue that by recognizing 
the many ways in which people benefit from 
nature, we cheapen nature and miss oppor- 
tunities to save components of it that have 
little or no obvious value to people. This is a 
valid concern, and one of many reasons why 
we must continue to uphold intrinsic values 
to audiences who share those values, or may 
be inspired towards them. However, instru- 
mental values will remain more powerful for 
other audiences, and should be used in the 
many contexts where broadening support for 
conservation is essential’. 

Clearly, all values will not be equally served 
in every context. Approaching conservation 
problems with representative perspectives 
and a broad base of respect, trust, pragmatism 
and shared understanding will more quickly 
and effectively advance our shared vision 
of a thriving planet. Prominent institutions 
already embrace multiple voices and values. 
For example, the field’s signature international 


treaty, the Convention on Biological Diversity, 
calls for the conservation of biodiversity, and 
for the sustainable use and equitable sharing 
of its benefits. Some countries leading in this 
area, such as Mexico, Costa Rica and Colom- 
bia, have followed suit, capturing these joint 
interests in their own governing language. 


PRACTICAL ACTION 

What now? Academic training of conser- 
vation scientists should more accurately 
portray the rich, global history of the field, 
introducing students to the diverse ways in 
which nature has been valued and conserved 
for centuries. More forums at conferences, 
in journals and on social media are needed 
to elevate the voices of scientists and prac- 
titioners from under-represented gen- 
ders, cultures and contexts. Conservation 
organizations and scientists can embrace all 
plausible conservation actors, from corpora- 
tions to governmental agencies, faith-based 
organizations and interested individuals, 
and advance conservation efforts when 
they can benefit people and when there is 
no obvious human-centric goal. 

These efforts must be underpinned by a 
stronger focus on synthesizing and expand- 
ing the evidence base that can identify what 
works and what fails in conservation so that 
we can move from philosophical debates to 
rigorous assessments of the effectiveness 
of actions. And we must encourage the full 
breadth of conservation scientists and prac- 
titioners to engage with the media so that 
coverage reflects the true range of opinion 


(for example, the 240 co-signatories listed 
are ready for interview) rather than the polar- 
ized voices ofa few. To add your name to this 
petition, visit diverseconservation.org. 

It is time to re-focus the field of conserva- 
tion on advancing and sharing knowledge 
in all relevant disciplines and contexts, and 
testing hypotheses based on observations, 
experiments and models”. We call for an 
end to the fighting. We call for a conserva- 
tion ethic that is diverse in its acceptance of 
genders, cultures, ages and values. = 


Heather Tallis is lead scientist at the Nature 
Conservancy in Santa Cruz, California, 
USA. Jane Lubchenco is professor of 
marine biology and of zoology at Oregon 
State University in Corvallis, Oregon, USA. 
e-mail: htallis@tnc.org 
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For a full list of co-signatories and further reading 
on this topic, see go.nature.com/tezttv. 


A to-do list for the 
world’s parks 


Experts share their priorities for what must be done to make 
protected areas more effective at conserving global biodiversity. 


BOB PRESSEY 
Maximize returns 
on conservation 


Professor, Australian Research 
Council Centre of Excellence for Coral 
Reef Studies, James Cook University 


Protected areas are meant to preserve 
biodiversity, but practice, measures of pro- 
gress and targets do not reflect this role. 


Governments and non-governmental 
organizations usually concentrate on politi- 
cally palatable measures, such as numbers of 
hectares. Measures of progress and targets for 
protected areas should focus on placing pro- 
tection where it can make the most difference. 

A 2008 study estimated that only 7% of 
protected forests in Costa Rica would have 
been lost if not protected (K. S. Andam et al. 
Proc. Natl Acad. Sci. USA 105, 16089-16094; 
2008). These forests, like most protected 
areas worldwide, are in ‘residual areas — 
those where direct human threats to biodi- 
versity are low, and where ‘protection’ makes 
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little difference. Misleadingly, target 11 of the 
Convention on Biological Diversity meas- 
ures progress in percentages of land and sea 
protected. Meanwhile, the biodiversity of con- 
tested places continues to be eroded. 
Performance metrics for protected areas 
should borrow from those in medicine, 
education and development. These fields all 
aim to maximize returns on investment. The 
language of programme evaluators is framed 
in terms of efficacy: what is the actual out- 
come ofan intervention, compared with the 
outcome expected from no intervention? 
For protected areas, efficacy means 
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avoiding the loss of species and maintaining 
the integrity of ecosystems. There are meth- 
ods for estimating the losses that protection 
has prevented (to provide lessons) or could 
prevent (to set priorities). By these metrics, 
protected areas can be disappointing. 

Success depends on which natural 
resources societies are willing to leave unex- 
ploited. The trends are not encouraging. 
Australia, for example, hosts this year’s World 
Parks Congress, but most of its terrestrial and 
marine parks are residual, and the country’s 
protected-area strategy has no quantitative 
targets for avoiding loss. The congress could 
make a real difference ifit steers policies away 
from meaningless, counterproductive targets. 
Each year of delay means avoidable, irrevers- 
ible loss of biodiversity. 


DOUGLAS J. MCCAULEY 
Mega-parks need 
greater oversight 


Assistant professor in the department 
of ecology, evolution and marine 
biology, University of California, 
Santa Barbara 


In September, US President Barack Obama 
created the world’s largest marine protected 
area network by massively expanding the 
Pacific Remote Islands Marine National 
Monument (PRIMNM). Collectively, the 


Areas surrounding Victoria Falls, on the Zambia-Zimbabwe border, are protected by national-park status. 


PRIMNM is more than five times the size 
of the United Kingdom. Its creation ups the 
ante in a conservation phenomenon without 
precedent on land or sea. The eight marine 
mega-parks (each more than 250,000 square 
kilometres) announced in the past five years 
have almost doubled the amount of pro- 
tected area in the oceans. 

Three actions must be taken to ensure 
that mega-parks do more good than harm 
for the world’s seas. First, governments must 
recognize that conventional forms of moni- 
toring for protected areas are not tenable in 
parks that are larger than some countries. 
To ensure that areas such as the PRIMNM 
do not become ‘paper parks’ — marked as 
protected on maps but exploited in reality 
— governments must explicitly fund the 
development and use of next-generation 
enforcement, such as satellite and drone- 
based patrols. Such tools are not cheap, but 
mega-parks will not function unless they are 
designated in budgets as well as on maps. 

Second, policy-makers must enact regu- 
lations to manage highly mobile animals in 
the 96% of the ocean left unprotected. Many 
of the most at-risk species (including some 
turtle, shark and marine mammal species) 
are not fully protected, even in parks as big 
as the PRIMNM. 

Lastly, the marine mega-park movement 
does not let us off the hook for protecting 
crucial marine habitats at smaller scales. 
Bigger is better with marine protected 
areas, but these benefits might not scale lin- 
early. Although establishing 100 strategically 


placed, 10,000-square-kilometre marine 
parks is politically intractable, it would prob- 
ably have done more for marine biodiversity 
than the establishment of just the PRIMNM. 
If ineffectual practices can be avoided, 
environmental leaders will undoubtedly 
look back on this marine-mega-park era 
as one of the most important periods in the 
history of ocean conservation. Ifnot, mega- 
parks will be little more than mega-hype. 


LANCE MORGAN 
Protect diverse 
marine habitats 


President of the Marine Conservation 
Institute 


A portfolio of well-protected, representative 
marine ecosystems — humankind’s in situ 
seed vault for ocean life — is needed for bio- 
logical and human resilience. Only about 
2% of the ocean has any protection, and just 
0.83% is ‘no-take’ reserves, where humans 
are not allowed to extract fish, oil or other 
resources. Marine biologists recommend 
that 20-30% of the ocean must be protected 
to maintain its biodiversity. This amount 
will provide enough abundance to restore 
depleted populations outside reserves. 

To accelerate establishment of highly 
effective biodiversity refuges, the Marine 
Conservation Institute has initiated the 
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Global Ocean Refuge System (GLORES). 
The prestige and social capital that comes 
from receiving the GLORES status can spur 
governments, much as ‘green building’ 
certification has helped the adoption of sus- 
tainable practices in construction. 

Capturing a diversity of habitats is key. 
US national parks, for example, often 
encompass mountainous areas of the 
United States, but not prairies and wetlands. 
GLORES considers the effect of a protected 
area in the context of others. It accounts for 
marine biogeography and connectivity; for 
example, kelp forests occur in temperate 
biogeographic regions, whereas coral reefs 
occur in tropical regions. 

GLORES criteria require effective moni- 
toring and enforcement, whether by commu- 
nities, scientists or other authorities. The goal 
is to create protected areas in all of the differ- 
ent ocean regions and habitats (shallow and 
deep, sandy and rocky bottoms, and more). 

GLORES will be easier, cheaper and faster 
to implement than many other approaches. 
Protecting places is much less knowledge- 
intensive and less costly than managing 
marine species one by one or persuading 
countries to protect areas one by one (often 
small areas that fishers care least about). 


Represent 
ecosystems 


Professor of mathematics and 
ecology, University of Queensland; 
Chair of conservation decisions, 
Imperial College London 


The Convention on Biological Diversity asks 
countries to conserve at least 17% of their land 
and 10% of their seas. It also calls for “eco- 
logical representation’, the equitable coverage 
of species and habitats, but sets no quantita- 
tive targets. Representation is often ignored 
in designing systems of protected areas. For 
example, the koala is just one of many spe- 
cies that prefers under-protected fertile, well- 
watered habitats that are also favoured for 
agriculture and other development. 

There are better approaches. Represen- 
tation can be highly efficient. In 2004, the 
Great Barrier Reef Marine Park Authority 
used extensive economic and ecological 
data to create a system of ‘no-take’ pro- 
tected areas that conserved at least 20% of 
every habitat while covering only 33% of 
the region. 

To help refocus priorities, our group 
created software called Marxan. It uses 
mathematical optimization to prioritize 
places to design efficient and representative 
protected areas. We have also developed a 


An anti-poaching team of the conservation group WWF on patrol in Minkébé National Park, Gabon. 


new metric, protection equality, to measure 
equitable representation of habitats in a 
single number. It is a modification of the 
Gini coefficient commonly used to assess 
income inequality. 

For example, the United States has a rela- 
tively large fraction of its land conserved, 
but its land protection equality is poor, 
only 0.33. Australia, which has a policy for 
representation, has a smaller fraction of its 
land conserved but a higher land protection 
equality of 0.51, a much more representa- 
tive system. The protection equality of the 
Great Barrier Reef Marine Park is 0.80, more 
than any country. By contrast, the proposed 
re-zoning of Australias Commonwealth 
waters is biased towards deeper waters and 
misses entire ecosystems. 

We hope that this year’s World Parks Con- 
gress will stimulate more-sophisticated tools 
for building representative systems of pro- 
tected areas, and metrics for assessing them. 


Manage parks 
professionally 


Executive secretary, Gabon National 
Parks Agency 


The special parts of our planet warrant and 
need exceptional stewardship. They are not 
getting it. We need a pact — between politi- 
cal leaders, civil society and conservation 
professionals — to increase the political 
capital of the environment. 

Many of the world’s rarest and most 
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iconic species — gorillas, chimpanzees, 
orangutans, elephants, lions, tigers and 
pandas — survive almost exclusively in 
protected areas. In both developed and 
developing countries, protected areas often 
contain the richest, most pristine ecosys- 
tems. They also provide crucial ecosystem 
services. Mangrove parks succour fisheries 
and protect against floods (see page 32); 
forests provide clean, reliable water and 
help to regulate the climate. Tourism and 
leisure use of parks improve people's quality 
of life. If preserved, the biodiversity within 
parks could well yield as-yet-unknown 
medicines and other products. 

We need to strengthen and profession- 
alize park management. Too many of the 
developing world’s protected areas are 
chronically underfunded. And government 
neglect often means that management falls 
to international non-governmental organi- 
zations (NGOs). Because this responsibility 
is rarely formalized, NGOs do not havea 
strong mandate to protect these areas well. 

For example, in the late 1990s, the 
government of Gabon failed to take 
responsibility for Minkébé National Park, 
so the conservation group WWE stepped 
in. Despite spending millions of dollars, 
WWE was unable to stop the slaughter of 
elephants there: at least 16,600 elephants 
were lost between 2004 and 2012, mainly 
to cross-border poachers. In May 2011, the 
government deployed 120 military person- 
nel to support parks staff; in October 2014, 
with the situation still not under control, it 
pledged to double those numbers. 

The people who fight to preserve our nat- 
ural and cultural treasures must be trained 
and backed by their nations. Only then 
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Coral gardens in the Palmyra Atoll and Kingman Reef national wildlife refuges, protected under the Pacific Remote Islands Marine National Monument. 


will we be able to resist the ever-growing 
pressures that transnational crime, cor- 
ruption and increasing population place 
on wildlife and wild lands. 


Conserve climate 
refuges 


David H. Smith conservation research 
fellow, University of North Carolina 
and the Wildlife Conservation Society 


Climate change raises a triple threat that 
existing marine protected areas were not 
designed to defend against. Warming, rising 
and acidifying seas threaten global marine 
biodiversity and ecosystem services. Even 
in protected areas, El Nino events and ocean 
heatwaves can bleach and destroy vast areas 
of healthy coral reefs — the canary in the 
coal mine of climate change. 

To give coral reefs and other global eco- 
systems time to adapt, we need to identify 
areas that will escape the worst impacts of a 
changing climate. These should be protected 
as ‘climate refuges’ — areas that will experi- 
ence less change over the coming decades. 
In the northern Mozambique channel and 
the Raja Ampat archipelago in Indonesia, for 
instance, upwelling and ocean gyres bring 
cool water that has allowed fragile corals to 
escape bleaching. Emerging evidence sug- 
gests that several million years ago, rare reef 
habitats that escaped rising temperatures 
provided the blueprint for contemporary 
diversity. Today, climate refuges may be 


our best hope for protected areas to sustain 
healthy coral reefs into the future. 

The first steps are to catalyse local 
communities, national governments and 
multilateral agencies to protect such areas. 
Urgently, we need to coordinate, fund and 
implement a global plan to link networks 
of climate refuges for all ecosystems: coral 
reefs, tropical rainforests, Arctic tundra and 
beyond. The World Parks Congress must 
lay the groundwork to incorporate climate 
refuges into conservation portfolios and 
protected areas. 


Assess governance 
structures 


Researcher on natural resource 
governance approaches, University 
College London 


Projects such as the International Union 
for Conservation of Nature’s Green List 
are beginning to evaluate the effectiveness 
of protected areas systematically. This will 
help to shift the focus of conservation efforts 
from targets assessed just by hectares to 
other, more-meaningful objectives, focused 
on effectiveness. But to learn from successes 
and failures, we must also evaluate govern- 
ance systems. 

These systems incorporate roughly five 
approaches: top-down regulation, bottom- 
up participation, market mechanisms, 
awareness-raising and knowledge-sharing. 
We need to know what makes each effective, 


6 
© 2014 Macmillan Publishers Limited. All rights reserved 


and how these different approaches can be 
combined to reinforce each other. 

On Chumbe Island, a private island park 
off the coast of Zanzibar in East Africa, 
diverse approaches mesh to form a strong 
governance framework. In 1994, a non- 
profit company was granted property 
rights to the island and its surrounding 
waters, along with obligations to the local 
environment and community. For example, 
income from ecotourists is invested in local 
schools and other community projects. The 
local police assist in enforcing a no-fishing 
zone, and anti-poaching patrols provide 
community services, such as helping fish- 
ing boats in peril. 

Projects with fewer approaches are less 
robust. Consider the Cres-Lo8inj Special 
Marine Reserve in Croatia. Here local 
authorities instituted the Adriatic Sea’s 
largest marine protected area for dolphins, 
only for the designation to lapse when 
commercial developers touted the jobs 
and other economic benefits that a recrea- 
tional marina could provide. In this case, 
top-down regulation, along with other 
governance approaches, might yet prove 
effective. The European Commission could 
oblige the Croatian government to reinstate 
protection as a condition of joining the 
European Union. 

Too often, conservation discussions 
descend into unproductive debates about 
which governance approach is best, but the 
best solution varies with context. We need 
to learn the principles to match combina- 
tions of approaches with situations. The key 
to resilience is diversity — both of species in 
ecosystems and approaches in governance 
systems. m 
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This former mangrove forest once provided benefits such as storm protection and carbon sequestration. 


Account for 
depreciation of 
natural capital 


Economic indicators that omit the depletion and 
degradation of natural resources and ecosystems 
are misleading, warns Edward B. Barbier. 


makers have been discussing Thomas 

Piketty’s 2013 economics best-seller, 
Capital in the Twenty-First Century’. It docu- 
ments the considerable rise over the past 
40 years in national wealth relative to national 
income in eight of the richest economies — 
the United States, Japan, Germany, France, 
the United Kingdom, Italy, Canada and Aus- 
tralia. The national wealth of each of these 
countries increased from 2-3 times national 
income in 1970 to 4-6 times income in 2010. 


FE: the past year, academics and policy- 


Piketty relies on standard income conven- 
tions as prescribed in the United Nations 
national accounts. He includes natural 
resources such as fossil fuels, minerals and 
forests in his estimate of a country’s capital. 
But his measures of national income and 
savings adjust only for depreciation of ‘fixed 
capital? — buildings, equipment and so on. 

We must also account for the depreciation 
of natural capital in appraising wealth. This 
is the value of net losses to natural resources, 
such as minerals, fossil fuels, forests and 
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similar sources of material and energy inputs 
into our economy. If we use up more natural 
capital to produce economic output today, 
then we have less for production tomorrow. 

At the same time, we are also squander- 
ing valuable ecological capital — ecosystems 
provide important goods and services to the 
economy, such as recreation, flood protec- 
tion, nutrient uptake, erosion control, water 
purification and carbon sequestration. By 
converting and degrading ecosystems, we 
are depreciating this important ecological 
capital endowment. 

Economic indicators change dramatically 
when the depletion and degradation of natu- 
ral resources and ecosystems are accounted 
for. Here, I show by how much, through a 
worked example of mangroves in Thailand. 
Depreciation of natural capital is particu- 
larly high in developing economies, which 
are often rich in resources and ecosystems. 
We must retool our measures of income 
and wealth accordingly, starting with net 
domestic product. 


CREATIVE ACCOUNTING 

Since 1970, the World Bank’s World 
Development Indicators have provided 
estimates for most countries of the adjust- 
ments to national income, income growth 
and savings that arise from net depletion 
of forests, energy resources and minerals. 
This rate of natural-capital depreciation as a 
percentage of adjusted net national income 
over the past four decades is alarming (see 
‘Natural capital’). 

Two global trends are noticeable. First, the 
decline in natural capital has been five times 
greater on average in developing economies 
than in the eight richest countries. Second, 
natural capital depreciation in all coun- 
tries has risen significantly since the 1990s. 
There was a dip during the global recession 
of 2008-09, but as the world economy has 
recovered, so has the rate of resource use. 

Ecological capital, too, is clearly endan- 
gered by current patterns of economic 
development. Over the past 50 years, ecosys- 
tems have been modified more rapidly and 
extensively than in any comparable period in 
human history, largely to meet burgeoning 
demands for food, fresh water, timber, fibre 
and fuel. According to the worldwide Millen- 
nium Ecosystem Assessment, approximately 
60% of major global ecosystem services have 
been degraded or used unsustainably, includ- 
ing fresh water, wild fisheries, air and water 
purification, and the regulation of regional 
and local climate, natural hazards and pests. 

Unfortunately, ecological capital, being 
unique, poorly understood and difficult to 
measure, tends to be undervalued. Consider 
the example of mangroves in Thailand from 
1970 to 2009*. Average annual mangrove 
loss in Thailand has fallen steadily in every 
decade since the 1970s. Yet cumulatively, 
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SOURCES: TOP, WORLD DEVELOPMENT INDICATORS; BOTTOM, REF. 2 


Thailand is estimated to have lost around 
one-third of its mangroves since the 1970s, 
mainly to the expansion of shrimp farming 
and other coastal development. 


MANGROVE ECONOMICS 

Mangroves provide four essential ecosystem 
benefits: wood and products such as shell- 
fish, plants, honey and medicines; nursery 
and breeding grounds for offshore fisheries; 
storm protection; and carbon sequestration. 

I use estimates of these benefits to deter- 
mine the annual net gain or loss in mangrove 
value resulting from conversion to other 
land uses. This net value has two compo- 
nents. The remaining mangroves generate 
extra benefits each year that do not appear 
in the national accounts, such as net sub- 
sistence for local coastal communities and 
economy-wide carbon-sequestration ben- 
efits. From these values, I subtract the net 
loss in land value that arises each year from 
converting mangroves to some other eco- 
nomic activity, such as shrimp farming. 

The economic impacts are significant. 
During the 1970s and 1980s, when man- 
grove deforestation was rapid, Thailand lost 
US$1.69 and $0.76, respectively in mangrove 
net values per person per year. By 2009, 
around one-third of the 1970 mangrove 
area was deforested and Thailand’s popula- 
tion had grown rapidly. As a result, the total 
value from the subsistence and carbon ben- 
efits of the remaining mangroves has halved, 
from $0.57 to $0.28 per person per year (see 
‘Cutting costs’). This means that even though 
mangrove loss slowed in the 1990s and 2000s, 
the net values of mangroves were very mod- 
est, only $0.11 and $0.25 respectively. 

To put it another way, cumulative man- 
grove deforestation over the past four 
decades in Thailand has cost each Thai 
citizen $40. This debit amounts to losses of 
more than $2.73 billion, which have never 
appeared in Thailand’s national accounts. 


COUNTING THE COST 

Many more examples are now needed — 
for different countries and regions, and 
for other key ecosystems, such as tropical 
forests, coral reefs, freshwater wetlands, 
grasslands and so on. 

There are three caveats. First, there 
are clearly intrinsic values to preserving 
unique natural resources, species and eco- 
systems, as well as the biological diversity 
contained in these systems, which are not 
captured by such an approach. Second, 
the benefits of many important ecosystem 
services are difficult to value, such as pol- 
lution control, pollination, climate regu- 
lation and watershed protection. Third, 
measures of natural-resource depletion 
need to move beyond minerals, energy 
and timber harvests to include other vital 
resources, such as soils, air quality, aquifers, 


NATURAL CAPITAL 


COMMENT 


The decline in natural capital has been five times greater on average in developing 


economies than in the eight richest countries. 
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CUTTING COSTS 


Losing one-third of its mangroves to deforestation since 1970 has cost Thailand more 
than US$2.73 billion — a sum that has never appeared in national accounts. 


| | Total value of ~~) Net change in 
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fisheries and non-timber forest resources. 

The UN and the World Bank have begun 
pilot studies to construct adjustments to 
income and wealth that include changes in 
ecological capital. The UN Inclusive Wealth 
Report 2012 has developed’ accounts from 
1990 to 2008 for 20 countries that include 
non-timber benefits from forests, carbon 
sequestration, fisheries (for four countries 
only), carbon damages and agricultural 
land, as well as minerals, energy and tim- 
ber. The World Bank is expanding pilot 
studies on ecosystem accounting from 
8 to 15 developing countries, which cover 
water, forest and mangrove ecosystems (see 
www.wavespartnership.org). 

For estuarine and coastal ecosystems, 
there are already 80 valuation estimates from 
all over the world for storm protection, ero- 
sion control, water purification and supply, 
carbon sequestration, recreation and main- 
tenance of fishing, hunting and foraging 
activities — and the list is growing’. 

What will it take to move beyond these 
encouraging pilot studies? The UN systems 
of national accounts must adopt a more 
systematic approach that all countries can 
follow to account for losses of natural capi- 
tal and ecological capital, as we already do 
for fixed capital depreciation. And, in the 
case of complex ecosystems and landscapes, 
we need to resolve problems of ‘double 
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counting’ ecosystem services that might 
serve as ‘inputs’ into production or that are 
provided by multiple ecosystems, such as the 
protection of shorelines simultaneously by 
coral reefs, seagrass beds and mangroves. 

Piketty might be right that, since 1970, 
there has been substantial accumulation 
of capital relative to income in the rich 
countries of the world. As low- and middle- 
income countries try to emulate this suc- 
cess, they will also be striving to accumulate 
more wealth. But as my estimates show, our 
economies have been trading one form of 
capital, Earth’s riches, for another — human 
riches. Without accounting accurately for 
this trade-off, we will continue to have a 
false impression of economic progress and 
growth. That is as dangerous as flying an 
aeroplane into the night without navigation 
tools or instruments. = 
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Surf's up at SLAC 


A ‘plasma afterburner’ just 30 centimetres long accelerates electrons hundreds of times faster than giant conventional 
accelerators. The result may ultimately open up a low-cost technology for particle colliders. SEE LETTER P.92 


MIKE DOWNER & RAFAL ZGADZAJ 


reported that 120 surfers in Australia rode 

the same wave simultaneously for more 
than 5 seconds'. “The trick was to get them 
all to do the same thing at the same time,’ said 
group leader Wes Smith. “It was an operation 
of military-like precision and we finally got 
there.” Now Litos and colleagues, in work at 
the SLAC National Accelerator Laboratory in 
Menlo Park, California, reported on page 92 
of this issue’, have ‘got there’ too, by surfing 
half a billion 20-billion-electronvolt electrons 
on a steep charge-density wave about the size 
ofa marine phytoplankton, travelling through 
ionized gas (plasma). The wave was driven bya 
companion electron bunch as it raced at nearly 
the speed of light through a 30-centimetre- 
long chamber filled with plasma (Fig. 1). 

Although this inaugural experiment lost 
about 90% of its ‘surfers’ along the way, the sur- 
viving electrons gained 1.6 billion electronvolts, 
or 1.6 gigaelectonvolts (GeV), in energy with 
unparalleled uniformity, maintaining roughly 
1% energy spread throughout their wild ride, 
while sucking away an unprecedented fraction 
(up to 30%) of the wave’s energy. Such uniform, 
efficient acceleration required the researchers 
to inject the surfing electron bunch into the 
wave, and to adjust the bunch’s charge and 
shape, with a military-like precision made 
possible by SLAC’s recently commissioned 
US$15-million Facility for Accelerator Science 
and Experimental Tests (FACET)’. Because the 
plasma wave accelerated electrons 500 times 
faster than SLAC’s main particle accelerator, 
the result might herald a new generation of 
compact ‘plasma afterburners’ that could boost 
the energy of conventional particle accelerators 
and potentially reduce the skyrocketing cost of 
high-energy physics machinery*. 

Seven years ago, before FACET was even 
proposed, the same team had used single 
bunches of about 10 billion 42-GeV electrons 
and accelerated them over the full 3.2-kilo- 
metre length of SLAC’s main machine to drive 
a similar plasma wave’. A handful of electrons 
from the tail of the drive bunch were caught in 
the drive bunch’s wake and were accelerated up 
to 84 GeV, twice the energy of the electrons in 
the original drive bunch, within a metre-long 


I n November 2012, Guinness World Records 


Figure 1 | Ramping up the energy. SLAC’s main 
accelerator, shown in aerial view, accelerates 
electron bunches from 0 to 20 GeV energy over 

2 km, which amounts to adding 0.01 GeV to 

each electron every metre. The new Facility for 
Accelerator Science and Experimental Tests 
(FACET) used by Litos et al.’ then splits each 
20-GeV bunch into two independently controlled 
tandem bunches. The leading bunch creates a new 
micro-accelerator inside a 30-cm chamber (top), 
in which it drives a charge-density wave in ionized 
gas, much as a boat drives a wake in water. The 
trailing bunch rides the lead bunch’s wake and, 
when optimally positioned, extracts up to 30% 

of its energy, boosting each electrons energy by 
1.6 GeV in only 30 cm. 
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plasma chamber. However, the electrons 
emerging from this first-generation plasma 
afterburner ranged in energy from less than 
about 35 GeV to 84 GeV, more electrons were 
decelerated than accelerated, and most of the 
energy of the plasma wave was left untapped. 
FACET — which now shares SLAC with the 
Linac Coherent Light Source, and thus starts 
with 20-GeV electrons accelerated over part 
of SLAC’s length — was designed to correct 
these shortcomings. The facility exploits new 
particle-beam technology to split the SLAC 
bunches into two tandem bunches whose time 
separation, charge and shape are, with some 
limits, independently controllable. 

In the new experiments, the researchers 
used a little over half of the 20-GeV SLAC 
bunch to drive a plasma wave, and then timed 
its nearly equally charged twin to surf just a 
hair’s breadth behind, where its core rode 
the enormous electrostatic field of the drive 
bunch’s wake. Without the trailing surfing 
bunch, this field would be far from uniform, 
varying from 3 billion to 10 billion volts per 
metre (fields stronger than ordinary, non- 
plasma matter can withstand) just over the 
tiny region in which the surfing bunch was so 
painstakingly positioned. 

Had the researchers injected a lower- 
charged surfing bunch, it would have suffered 
the same fate as in the earlier experiment by 
broadening in energy. This would render it 
useless for high-energy physics applications, 
which require particle energy to be tuned 
precisely to create and identify new particles, 
such as the Higgs boson. However, Litos et al. 
took advantage of physics learned from com- 
puter simulations’ showing that a high-charge 
surfing bunch could ‘load’ the plasma wake, 
flattening its electrostatic fields locally. It is 
as if the 120 Australian surfers had sufficient 
collective weight to flatten the curved ocean 
wave into an inclined plane so that they could 
all accelerate at the same rate. This trick solved 
two problems simultaneously: it enabled 
a high-charge bunch to accelerate nearly 
monoenergetically while maximizing energy 
extraction from the plasma wake. 

Can plasma surfing meet future needs of 
high-energy physics research, which include 
electron bunches with sufficiently high energy, 
charge, repetition rate and focusability that they 
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can create detectable amounts of new particles 
that may be lurking in the cosmic underworld? 
The jury is still out. The present 1.6-GeV energy 
gain (starting from 20 GeV) is no greater than 
that achieved by plasma accelerators driven by 
light pulses from lasers (starting from zero)’, 
which are much smaller and less-expensive 
instruments than SLAC. Nevertheless, electron- 
driven plasma accelerators scale more readily to 
gains of tens of gigaelectronvolts than do their 
laser-driven counterparts, as demonstrated in 
previous work", 

Improved bunch-shaping technology will 
better match surfing bunch to plasma wave, 
increasing electron-survival rate and thus the 
number of accelerated electrons. Yet the Higgs 
boson has a mass equivalent to 126 GeV, and 
physical theories such as supersymmetry pre- 
dict additional particles that have even greater 
mass than the Higgs and may be the source of 
the elusive ‘dark matter’ that seems to comprise 
about 25% of the Universe. Creating and identi- 
fying these new denizens of the Universe could 
set the next energy frontier at many thousands 
of gigaelectronvolts. Reaching these energies 
will probably require synchronized, multi- 
staged plasma accelerators — a daunting, and 
largely unexplored, technical challenge in view 
of the micrometre dimensions of plasma waves. 

An interesting alternative proposal is to 
drive plasma waves with very energetic pro- 
ton bunches, which because of their greater 
mass can push plasma waves for hundreds of 
metres, potentially accelerating electrons to 
the energy frontier in a single stage’. In either 
case, plasma acceleration of positrons (anti- 
electrons) lags far behind electron acceleration 
because plasma waves shaped like those in the 
current experiment defocus surfing positron 
bunches, degrading their usefulness. Positron 
acceleration is important because high-energy 
collisions of electrons and positrons, a natural 
matter—antimatter pair, create a richer collec- 
tion of products with higher efficiency than, 
say, electron-electron collisions, and thus offer 
one of the most promising routes to particle 
discovery. FACET, with access to SLAC’s com- 
panion positron beam, is uniquely positioned 
to explore new ways to shape plasma waves 
in order to advance plasma-based positron 
acceleration. 

Finally, even if the energies and charges 
required for an electron—positron collider are 
achieved, debate rages over whether focused, 
plasma-surfed particle beams can yield parti- 
cle-discovery events at rates competitive with 
those achieved with conventional accelera- 
tor technology””"’, which underlies proposed 
tens-of-kilometres-long machines such as the 
International Linear Collider and the Com- 
pact Linear Collider. These uncertainties not- 
withstanding, Litos et al. have overcome one 
of the most difficult challenges so far in the 
long quest for small, affordable accelerators, 
and have given the plasma-surfing community 
every reason to surge ahead. = 
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Cells unite by 
trapping a signal 


Gradients of fibroblast growth factors often induce cells to adopt different fates. A 
study in zebrafish embryos reveals another, unexpected role when the factors are 
trapped in small spaces by a special arrangement of cells. SEE LETTER P.120 


JAMES SHARPE 


uilding complex, multicellular organs 
B during embryonic development is not 
just about making different cell types, it 

is about getting the right cells in the right place. 
For a cell to have some sense of where it is, it 
must integrate diffusible signals released by its 
neighbours. On page 120 of this issue, Durdu 
et al. provide evidence for a surprising new 
way by which diffusible signals such as fibro- 
blast growth factors (FGFs) are controlled — 
by trapping them in small, closed extracellular 
spaces called microlumina, from which they 
have access to only a discrete collection of cells. 
Exactly how signalling molecules provide 


Threshold 


FGF signalling levels 


WANS CK 


enough spatial information to build complex 
organisms is still obscure, but studies of the 
principles of signalling generally split into 
two types. Those of the upstream part of sig- 
nalling ask how the movement of diffusible 
molecules — sometimes called morphogens 
— is controlled to form appropriate spatial 
gradients~’, for example by ‘sticky’ molecules 
in the extracellular matrix*. Studies of the 
downstream part ask how these spatial distri- 
butions are used by receiving cells to control 
cellular ‘decisions. A well-characterized exam- 
ple of the upstream part is the FGF family of 
secreted proteins’. These often form coher- 
ent spatial gradients within which different 
levels of signalling divide the responding cell 
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Figure 1 | Roles of fibroblast growth factors (FGFs). a, A common role for FGFs is to induce different 
cell fates through spatial variations in FGF levels determined by the distance from FGF-expressing cells. 
In this schematic, cells that experience FGF levels above a threshold value are induced, whereas the others 
are unaffected. b, During the development of zebrafish embryos, a group of cells called the primordium, 
shown here from above, migrates from near the head-end to the tail. As it migrates, cells cluster into 
rosette structures that drop off at regular intervals and then develop into mechanosensory organs. Durdu 
et al.' report that confinement of FGFs to the microlumen at the centre of a rosette coordinates the cells 
within that cluster, so that the rosette drops off in a well-organized manner. c, The microlumina are 
enclosed by a patchwork of membrane sections contributed by all the surrounding cells, as shown in this 
side view ofa rosette. The right-hand graphic shows how one cell contributes to the microlumen, which is 


not shown to scale. 
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population into sub-groups with different 
fates (Fig. 1a). 

Durdu and colleagues took a fresh look at 
FGFs in their study of the development of 
the zebrafish lateral line — a sensory organ 
that lies along either side of all fishes, allow- 
ing them to sense vibrations in the water. In 
this developmental process, about 100 cells 
(called the lateral-line primordium) start 
near the head-end of the embryo and, over a 
two-day period, collectively migrate along the 
entire length of the developing body under 
the skin towards the tail®. During this journey, 
subgroups of cells cluster together within the 
primordium. These are called rosettes, because 
the cells adopt a radial arrangement in which 
each cell has an extension towards an apparent 
central common connection point (Fig. 1b). 
As the primordium migrates along the body, 
it drops off these rosettes one by one at regular 
intervals. Each rosette goes on to develop into 
a discrete mechanosensory organ. 

The authors knew that manipulating FGFs 
can affect the spacing of dropped organs’, 
but not whether this was through a general 
effect on primordium velocity. They therefore 
quantified time-lapse movies of developing 
zebrafish embryos in which Fgf3 levels had 
either been upregulated by overexpression or 
repressed by drug inhibition. In both cases, 
they saw that the migratory velocity of the 
primordium was unaltered, which means 
that Fgf3 was affecting the drop-off frequency 
instead. 

Having established a clear link between FGF 
signalling and rosette drop-off, Durdu et al. 
next explored where the signalling occurs. 
Fluorescence imaging of Fgf3 attached to 
green fluorescent protein suggested that it was 
localized into small, concentrated volumes at 
the apical centre of each rosette. Correlative 
microscopy (which combines fluorescence 
microscopy with electron microscopy) then 
revealed a striking cell-membrane arrange- 
ment: at the apical centre of each rosette was a 
microlumen formed by the cell membranes of 
all the cells of that rosette (Fig. 1c). 

The researchers again used time-lapse imag- 
ing to show that the moment when F¢f3 starts 
to accumulate in a microlumen correlates 
with the time when that rosette begins to slow 
down in preparation for dropping out of the 
primordium. This pointed towards the intrigu- 
ing possibility that FGF signalling is used ona 
very local basis to control the behaviour of just 
the 20 or so cells of one rosette. Durdu and co- 
workers went on to use all the advantages of the 
zebrafish system — ease of genetic modifica- 
tion and micromanipulation, and its suitability 
for high-quality time-lapse imaging — to test 
the idea. 

They modified a single rosette so that one of 
its cells had increased Fgf3 levels (using either 
single-cell transplantation or a stochastic 
inducible genetic system), and observed that 
just this rosette was forced to drop out early 


from the primordium. On average, neither the 
rosettes before nor after it were prematurely 
dropped. To perform the opposite experiment, 
they punctured microlumina with a laser, 
thereby letting Fgf3 leak out. Satisfyingly, they 
observed the expected delay in rosette drop- 
off, again without affecting the previous or 
subsequent rosettes. 

Several questions are not addressed in the 
study: for example, how the microlumina form 
in the first place; how levels of FGF expression 
are controlled; and, perhaps most directly 
relevant to the authors’ findings, how FGF 
signalling accelerates rosette drop-off. But the 
strength of Durdu and colleagues’ experiments 
is that single rosettes were manipulated in vivo, 
thus providing evidence that the microlumen 
can indeed restrict FGF signalling to the cells 
of just one rosette. 

In this system, FGFs do not adopt one of 
their conventional upstream roles, in which a 
coherent swathe of different signalling levels 
splits a responding population of cells. Instead, 
the microlumen forces FGFs to take on a more 
downstream role: coordinating the response to 
a morphogenetic event, and ensuring that all 
cells of the rosette respond while none of the 
neighbours do. It is an intriguing case of 
multicellular architecture feeding back to 
control molecular signalling directly. 


ASTROPHYSICS 


Because FGF concentrations accumulate 
only when the microlumen is topologically 
complete, the factors also provide a tempo- 
ral checkpoint to the process. It thus unites a 
group of cells both temporally and spatially in a 
coordinated all-or-nothing response. This is an 
interesting, and slightly surprising, way to use a 
highly diffusible signalling molecule, but may 
turn out to be a widely employed mechanism 
in nature. @ 
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Monster star found 
hiding in plain sight 


Massive stars are rare, but they are sources of some of the most energetic 
phenomena seen in the Universe today. A high-mass candidate has now been 
found in a star-forming region that has been observed for more than 50 years. 


DONALD F. FIGER 


he most massive stars in the Universe 

captivate the imagination of laymen 

and experts alike. They represent an 
extreme form of star and produce outsized 
effects on their environment. Although stars 
with masses greater than 20 times the Sun's 
mass comprise only about 1% of all stars in 
a young star cluster, their ionizing radiation, 
stellar winds and ejecta from supernovae 
dominate some of the most observable phe- 
nomena in the Galaxy. Massive stars are among 
the few bodies that can be seen in other galax- 
ies, and they are probably linked to the most 
massive explosions in the Universe. Finally, 
they are thought to have seeded the early Uni- 
verse with heavy elements (those heavier than 
helium), which are now seen in even the oldest 
stars. Writing in Astronomy & Astrophysics, 
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Wu et al.’ identify the next heavyweight 
contender — a star with the decidedly unsexy 
name of W49nrl. 

Wu and colleagues claim a mass for this star 
that would place it among the most massive 
known, but a sceptic might say “extraordi- 
nary claims require extraordinary evidence”. 
Indeed, astronomers have, on further inspec- 
tion, often thrown such assertions on the 
rubbish heap of history. 

This kind of claim relies on models that 
translate the amount of observed starlight into 
an estimate of the mass of the star. Generally, 
the more massive the star, the brighter it is. As 
is almost always the case, Wu et al. observe 
light from the star over only a fairly narrow 
range of wavelengths, representing much less 
than 1% of the total emitted light. It would be 
useless to convert that relatively small portion 
of the total light into a mass estimate were it 
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Diversity breeds 
complementarity 


Evolutionary and ecosystem processes have long been treated as distinct. The 
finding that interactions among plant species cause rapid evolutionary changes that 
affect ecosystem function suggests that it is time for unification. SEE LETTER P.108 


DAVID TILMAN & EMILIE C. SNELL-ROOD 


he great naturalist Charles Darwin 
proposed his theory of evolution by 
natural selection as a unifying expla- 
nation for patterns seen in the natural world. 
But the unity sought by naturalists gave way 
to more-fragmented perspectives as natural 
history itself speciated into the modern disci- 
plines of ecosystem ecology, community ecol- 
ogy, population biology, palaeontology and 
evolution. In this issue, Zuppinger-Dingley and 
collaborators’ (page 108) have taken a signifi- 
cant step towards a reunification of these disci- 
plines. Their findings in an experimental study 
of plants suggest that ecosystem and evolution- 
ary processes cannot be separated: ecological 
interactions among a large number of plant 
species can cause rapid evolutionary changes 
that, in turn, influence ecosystem processes. 
An idea central to both ecology and 
evolution is that of the niche — the set 
of environmental conditions in which a 
particular species thrives. In ecology, niche 
differences among species help to explain why 
large numbers of competing species coexist, 
and why greater plant diversity leads to greater 
ecosystem productivity’. In evolutionary biol- 
ogy, the niche concept features prominently in 
our understanding of how new species arise. 
Competition between closely related species 
drives the evolution of trait differentiation, 
such as bird beaks that are specialized for 
different seeds or lizard limbs that are suited 
for either climbing or walking. The evolution 
of such character displacement can be seen in 
laboratory experiments using microorgan- 
isms’ and in field studies of incipient species 
formation, such as in Darwin’ finches on the 
Galapagos Islands’. 
In their study of character displacement, 
Zuppinger-Dingley and collaborators made 
use of experimental field plots in which 


16 species of grassland plant were grown either 
in monocultures or in mixed plots of 4 or more 
species for 8 years. They then collected these 
plants, propagated them in the lab, and assem- 
bled the offspring in new communities: either 
monocultures or mixed communities of two 
species. They observed that, relative to the 
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monocultures, the 8-year period of selection 
in the high-diversity communities caused 
shifts in the traits of the plant species, specifi- 
cally in plant height and leaf thickness. These 
shifts were consistent with character displace- 
ment and niche differentiation (Fig. 1a). The 
researchers also observed ecosystem-level con- 
sequences of these rapid evolutionary changes: 
the mixed cultures of plants from the diverse 
communities were more productive in terms of 
biomass than were mixed cultures from mono- 
cultures. These results exemplify the emerg- 
ing field of eco-evolutionary dynamics, which 
emphasizes that not only does ecology drive 
evolution, but evolutionary change feeds back 
to affect ecological processes”. 

In Zuppinger-Dingley and colleagues’ 
study, laboratory propagation of the plants 
increased the chance that the differences 
between the high- and low-diversity selec- 
tion groups were due to genetic divergence. 
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Figure 1 | Evolutionary niche shifts. a, Zuppinger-Dingley et al.’ find that, when plant species are grown 
in acommon environment, those that have a history of selection in diverse communities develop greater 
differences in traits (such as height and leaf thickness) than species that have a history of isolation. b, This 
idea feeds into our understanding of how evolutionary history influences the ecological interactions of 
species that compete for growth factors such as soil nutrients, light and space. All species face trade-offs. 
For instance, biomass that is allocated to obtaining soil nutrients (roots) cannot be used to obtain light 
(leaves and stems) or to disperse to open sites (seeds). Graphically depicted, the resulting ‘trade-off 
surface’ (triangles) represents all possible ways in which plant species (ellipses) can allocate their biomass. 
A history of selection in diverse communities results in greater interspecific differences (less overlap of 
ellipses) and more specialization (smaller ellipses) than a history of isolation. 


44 | NATURE | VOL 515 | 6 NOVEMBER 2014 


© 2014 Macmillan Publishers Limited. All rights reserved 


However, it is possible that epigenetic factors 
— heritable changes that do not involve 
DNA-sequence changes — could have had 
a simultaneous role’. If so, this invokes a 
broader question’ concerning the influence 
of developmental plasticity in niche differ- 
entiation: might character displacement in 
a diverse community initially be driven by 
developmental responses to resources and 
competitors that are later genetically assimi- 
lated as speciation occurs? The present study 
suggests that this challenging question could 
be addressed in real time with an experimen- 
tal, field-based approach. 

Because natural communities are diverse, 
selective forces that emerge from interactions 
between many species may be unexpectedly 
influential factors that shape species traits. 
How might we conceptualize this possibil- 
ity? Stable coexistence requires both character 
displacement and evolutionarily unavoidable 
trade-offs between species*”. Such interspecific 
trade-offs occur only if allocation of biomass 
to traits that increase performance in one 
type of environment decreases performance 
in other environments. For example, plants 
that have greater root mass perform better 
in infertile soils, but those that have more leaf 
and stem mass — and thus are taller and cap- 
ture more light — dominate fertile soils. Such 
trade-off ‘surfaces’ could explain how the eco- 
logical interactions that allow multi-species 
coexistence also influence the rate and pattern 
of species formation*” (Fig. 1b). 

Although the disciplines of ecosystem 
ecology and evolution have developed their 
own perspectives, if each incorporated ele- 
ments of the other, both disciplines would be 
strengthened. It is time for a reunification of all 
of the branches of natural history in a renewed 
search for unified explanations of the patterns 
seen in the natural world. = 
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Shape control in 
reactions with light 


The report of a light-activated catalyst that dictates the three- dimensional shape 
— the stereochemistry — of molecules formed in an organic reaction suggests a 
new strategy for controlling such reactions using visible light. SEE LETTER P.100 


KAZIMER L. SKUBI & TEHSHIK P. YOON 


a molecule absorbs light. Such reactions 

are greatly valued by organic chem- 
ists for their ability to promote fascinating 
changes in molecular structure that cannot 
be replicated in any other way. However, the 
application of these reactions for syntheses 
has long been hindered by several practical 
limitations. One of the biggest is the dearth of 
effective strategies for controlling the three- 
dimensional shape of the organic molecules 
produced. On page 100 of this issue, Huo et al." 
report an elegant approach to address this 
long-standing challenge. 

The interaction between light and matter 
constitutes one of the most active areas of scien- 
tific research. This year, for instance, the Nobel 
prizes for physics and chemistry were awarded. 
for the development of efficient light-emitting 
devices and for the use of fluorescence in ultra- 
high-resolution microscopy, respectively. From 
energy science to biomedicine to materials 
engineering, photochemistry is a vibrant theme 
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of such research, transecting many fields. 
Photochemical reactions have been used 
to streamline complex syntheses and to build 
structurally unusual organic frameworks. How- 
ever, organic molecules are generally transpar- 
ent to visible light — they cannot absorb its 
energy for use in chemical reactions. Organic 
photochemical reactions have typically needed 
ultraviolet light, which requires specialized 
equipment and instrumentation capable of 
handling high-energy ultraviolet photons. This 
has limited the study of photochemical synth- 
esis to a fairly small community of specialists. 
But in the past several years, a variety of 
exciting photoreactions have been developed 
that use visible light, and so can be carried out 
with simple household light sources or even 
sunlight’. The key insight was that certain 
transition-metal complexes (typically based on 
ruthenium or iridium) that absorb relatively 
low-energy wavelengths of visible light can 
be used as catalysts to activate a wide range of 
organic substrates, thereby enabling new reac- 
tions to take place. Although this development 
has fuelled renewed interest in photochemical 
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Figure 1 | Light-controlled stereoselectivity. Huo et al.’ report a general reaction in which a light- 
activated iridium catalyst controls the stereochemistry of the product. In this example, an acyl imidazole 
forms an enolate (blue), which in turn forms a complex with the catalyst (red). Bonds shown in bold 
project above the plane of the page, whereas hashed bonds project behind the page. The catalyst also 
converts a-bromo-2,4-dinitrotoluene (a benzyl halide compound) into a radical intermediate and a 
bromide ion (Br ) by donating an electron (e’). The radical reacts only at the top face of the enolate, 
because part of the catalyst blocks the bottom face. This ensures that the stereochemistry of the product 
is well defined (the green group in the product projects above the page in most of the formed molecules, 
rather than below). Me, methyl; Ph, phenyl; t-Bu, tert-butyl (C(CH,),, a highly bulky group); Ir, iridium; 
Br, bromine; the dot on the radical indicates a single electron. 
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synthesis, control over the three-dimensional 
structure of the organic products has remained 
a problem. 

This is an important problem, because 
the ability to form one mirror-image isomer 
(stereoisomer) of a molecule in preference to 
the other has profound ramifications in bio- 
logical and pharmaceutical contexts: the two 
mirror-image forms often have drastically 
different physiological effects. Similarly, the 
macroscopic physical properties of polymeric 
organic materials can be strongly affected by 
the stereochemistry of their monomeric com- 
ponents. Stereoselective synthesis therefore 
remains one of the central challenges in mod- 
ern synthetic chemistry. 

Since 2009, Eric Meggers’ research group 
has been developing methods for preparing 
ruthenium and iridium complexes as single 
stereoisomers’. In studying these complexes as 
catalysts for several organic reactions, Meggers 
and co-workers have demonstrated*” that the 
three-dimensional arrangement of the com- 
plexes can be transferred with exceptional 
fidelity to the organic products that they create. 
The same research group — Huo et al. — now 
shows that these transition-metal catalysts are 
also photoactive, and that this property can 
be exploited to perform highly stereoselective 
photochemical reactions. 

As a model system, the authors chose to 
study the a-alkylation of carbonyl compounds 
—abenchmark reaction in stereoselective syn- 
thesis (Fig. 1). The iridium catalyst first binds 
to an acyl imidazole compound, creating a 
structurally well-defined enolate complex. 
Photoexcitation of this complex initiates an 
electron-transfer process that converts a reagent 
(a benzyl halide) into a highly reactive radical 
intermediate. The geometry of the catalyst 
shields one face of the planar enolate from 
reaction (the bottom face in Fig. 1) and forces 
the radical to form a bond to it preferentially 
from the opposite face. The iridium catalyst 
thus serves two distinct roles: it simultaneously 
photoactivates one component of the reaction 
(the benzyl bromide) and controls the facial 
selectivity of the other (the enolate). 

These findings will attract considerable 
attention from synthetic chemists. Photo- 
chemical activation typically produces highly 
reactive intermediates whose stereochemical 
preferences have historically proved diffi- 
cult to control’. Some of the most successful 
approaches have needed two catalysts, with 
one performing the photochemical activation 
and the other dictating the stereoselectivity of 
the reaction”*. The discovery ofa single tran- 
sition-metal catalyst that fulfils both roles is a 
crucial conceptual step forward. 

Huo and colleagues’ reaction design 
combines the previously reported, precise 
stereoselective control exerted by their tran- 
sition-metal complexes with the practical- 
ity of using visible light for photochemistry. 
Future investigations will surely build on this 


impressive result. Because the products of 
the reported reaction could also be made by 
more-conventional methods, the next step will 
be to show that the new catalytic strategy is 
applicable to other classes of photoreaction for 
unmet synthetic applications. More broadly, 
this work provides inspiration for chemists to 
further explore how photochemistry might be 
used to transform organic synthesis. m 
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Metastasis risk after 
anti-macrophage therapy 


Blocking the activity of macrophages may delay the spread of cancer. But new 
findings show that these immune cells can rapidly rebound to tumours after 
therapy withdrawal, accelerating lethal metastasis in mice. SEE LETTER P.130 


IOANNA KEKLIKOGLOU & MICHELE DE PALMA 


acrophages are immune cells that are 
M= players in our bodies’ defence 

against invading pathogens. More- 
over, they participate in organ development, 
remodelling, healing and disease’. Macro- 
phages are also found in tumours, where they 
seem to support tumour progression and 
spread by means of several mechanisms’. This 
has prompted the development of drugs that 
impair macrophage survival*’, block their 
infiltration into tumours’ or reduce their pro- 
tumoral functions’. In this issue, Bonapace 
et al.’ (page 130) report that, in mice, although 
the continuous blockade of macrophages con- 
strains tumour progression, cessation of the 
therapy stimulates them to rapidly rebound to 
the tumours, unexpectedly leading to acceler- 
ated metastatic disease. 

Monocytes — the circulating precursors 
of macrophages — enter a tumour from the 
bloodstream and subsequently differenti- 
ate into macrophages’. The recruitment of 
monocytes and their differentiation into 
tumour-associated macrophages are regu- 
lated by signalling molecules released by the 
tumour or its metastases. One of these is C-C 
chemokine ligand 2 (CCL2), a protein that 
attracts monocytes expressing the receptor for 
CCL2, called CCR2. Blocking the binding 
of CCL2 to CCR2-expressing monocytes inhib- 
its macrophage infiltration into the metastases 
that form in the lungs of mice with mammary 
tumours, delaying the progression of meta- 
static cancer and extending mouse survival’. In 
humans, both high levels of CCL2 expression 
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and macrophage infiltration in the tumours 
correlate with a poor prognosis in some can- 
cer types, such as breast cancer**. Because 
tumour metastasis to the body’s vital organs 
is the main cause of cancer-associated death, 
blocking CCL2 may be an attractive strategy to 
help combat metastasis in patients with breast 
cancer and, possibly, other tumour types. 

Bonapace et al. used a neutralizing antibody 
against CCL2 to block the protein's activity 
in mice with mammary tumours. In agree- 
ment with previous studies’, CCL2 block- 
ade decreased macrophage recruitment to 
the tumours and reduced the incidence and 
growth of the lung metastases (Fig. 1). The 
authors observed that the number of circu- 
lating cancer cells shed from the primary 
tumour, which can travel to the lung and initi- 
ate metastases, was reduced during anti-CCL2 
treatment. This suggests that CCL2 neutraliza- 
tion had a direct effect on the primary tumour, 
possibly through macrophage regulation of the 
growth and characteristics of tumour blood 
vessels’ — the first barrier encountered by 
cancer cells in their journey to distant organs. 
But the treatment may also affect the estab- 
lishment and growth of newly settled metas- 
tases, for example by inhibiting macrophage 
production of vascular endothelial growth 
factor A (VEGF-A), a protein that stimulates the 
formation of blood vessels in tumours’. 

But in a dramatic twist, the authors found 
that interrupting anti-CCL2 therapy accel- 
erated the development of lung metastases 
and death (Fig. 1). As early as 10 days after 
withdrawal of the therapeutic antibody, they 
observed abnormally increased numbers of 
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Figure 1 | Monocyte rebounds after therapy. a, The presence of macrophages 
in primary tumours is associated with tumour growth and metastatic spread 

to distant organs. (Macrophages differentiate from monocytes, which form 

in the bone marrow.) b, Bonapace et al.’ confirm previous findings that 

when mice with mammary tumours are treated with anti-CCL2 antibodies, 
this blocks the egress of monocytes from the bone marrow and impairs 
macrophage infiltration into primary tumours, leading to reduced formation 


circulating cancer cells and monocytes in the 
blood of the mice, and this was associated 
with rapid infiltration of monocytes in the 
lung and faster metastatic tumour growth. 
These pro-metastatic effects were due, at least 
in part, to the growth-promoting functions of 
the monocytes recruited to the lung, because 
removal of the primary tumour after anti- 
CCL2 therapy also led to increased metastasis, 
despite the lower numbers of cancer cells in the 
bloodstream. 

So how does cessation of anti-CCL2 therapy 
promote monocyte rebounds to the metastatic 
tumours? Bonapace and colleagues suggest that 
the mechanism may involve heightened CCL2 
levels in the lungs of the mice after therapy. 
Although this phenomenon could be due to the 
stabilization of CCL2 in complex with the neu- 
tralizing antibody, there is clinical evidence that 
free CCL2 levels surge in patients with cancer 
who are treated with the human anti-CCL2 
antibody carlumab, as early as one week after 
the first antibody infusion and regardless of 
subsequent infusions”. Together, these find- 
ings suggest that pharmacological targeting of 
CCL2 may trigger a feedback mechanism that 
fuels CCL2 production. In this scenario, cessa- 
tion of anti-CCL2 therapy may rapidly (albeit 
transiently) generate abnormally high levels of 
free CCL2 that foster monocyte recruitment to 
the metastatic tumours. 

Another mechanism may involve the mode 
of action of the anti-CCL2 antibody. Sys- 
temic neutralization of CCL2 does not impair 
the production of monocytes in the bone 
marrow, but rather blocks their mobilization to 
the circulation. Bonapace et al. observed that 
monocytes accumulated in the bone marrow 
of their mice during anti-CCL2 therapy, and 
that therapy withdrawal unleashed these cells, 
leading to their accumulation in the blood, 
lungs and tumours. Therefore, post-therapy 
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rebounds of CCL2 and circulating monocytes 
may have cooperatively contributed to increas- 
ing macrophage infiltration into the meta- 
static tumours. Once recruited en masse, the 
macrophages seemed to precipitate metastatic 
tumour growth mainly through their produc- 
tion of VEGF-A, and, indeed, pharmacological 
blockade of VEGF-A following cessation of 
anti-CCL2 therapy restored normal (non- 
accelerated) tumour progression in the animals. 
Anti-macrophage therapies are currently 
being investigated in patients with cancer, 
but have not yet received official approval for 
clinical use. Although carlumab did not show 
antitumoral activity in initial clinical trials, 
possibly because it failed to stably neutral- 
ize CCL2 in the patients’ circulation”’, other 
drugs have been developed that specifically 
target macrophages through different mecha- 
nisms. Among these are antibodies or small- 
molecule inhibitors that block the activity of the 
colony-stimulating factor-1 receptor (CSF1R), 
a signalling receptor that controls the differen- 
tiation and survival of macrophages’. Whereas 
anti-CCL2 antibodies sequester monocytes in 
the bone marrow and block their recruitment 
to the tumours, anti-CSF1R antibodies func- 
tion mainly as monocyte- (and macrophage-) 
depleting agents’, and thus are unlikely to cause 
monocyte rebounds to tumours post-therapy. 
Even so, the depletion of macrophages 
in tumours may stimulate the intratumoral 
accumulation of another immune-cell type 
endowed with protumoral functions, the 
neutrophil’. Neutrophil rebounds in macro- 
phage-depleted tumours may restore, or even 
precipitate’, tumour progression in mice. 
It remains to be seen whether macrophage- 
depleting drugs elicit similar compensatory 
responses in patients with cancer. 
Macrophages can suppress the antitumoral 
functions of T cells of the immune system’, 
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of lung metastases. c, But the authors also find that cessation of anti-CCL2 
therapy is rapidly followed by mobilization of monocytes from the bone 
marrow to the circulation and their differentiation to macrophages in 
metastatic tumours, where they promote the formation of tumour blood 
vessels and tumour growth. This seems to be mediated by macrophage 
production of the growth factor VEGF-A, and by the signalling molecule 


so their transient depletion in tumours may 
increase the efficacy of immunotherapy 
designed to evoke T-cell-mediated antitumour 
responses*. Also promising are pharmacological 
approaches that can ‘reprogran’ macrophages 
from being pro- to antitumoral effector cells, by, 
for example, unleashing their ability to kill cancer 
cells or to present tumour antigens to T cells®. 

Bonapace and colleagues’ findings under- 
score both the therapeutic potential and the 
possible shortcomings of anti-macrophage 
approaches for cancer therapy. The arsenal 
of macrophage-targeted drugs is constantly 
expanding, with increasing sophistication 
and versatility in their modes of action. When 
combined with treatments against cancer 
cells and/or when their use is timed to mag- 
nify antitumoral immune responses, this fresh 
therapeutic asset should prove useful in the 
enduring fight against cancer. m 
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e are living in the Anthropocene epoch, 

We period of time in which human actions 
have a dominant influence on many of 

Earth’s physical and biological processes. These REVIEWS 

processes, which are organized into ecosystems, are, in 

turn, responsible for providing humanity with many 

essential goods and services. It is therefore important 

that we rein in our impacts so that ecosystems can 
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Implications of agricultural transitions 
and urbanization for ecosystem services 


Graeme S. Cumming’, Andreas Buerkert?, Ellen M. Hoffmann’, Eva Schlecht’, Stephan von Cramon-Taubadel* & Teja Tscharntke®* 


Historically, farmers and hunter- gatherers relied directly on ecosystem services, which they both exploited and enjoyed. 
Urban populations still rely on ecosystems, but prioritize non- ecosystem services (socioeconomic). Population growth 
and densification increase the scale and change the nature of both ecosystem- and non-ecosystem-service supply and 
demand, weakening direct feedbacks between ecosystems and societies and potentially pushing social-ecological systems 
into traps that can lead to collapse. The interacting and mutually reinforcing processes of technological change, population 
growth and urbanization contribute to over-exploitation of ecosystems through complex feedbacks that have important 


implications for sustainable resource use. 


ontemporary research suggests that humanity is over-exploiting 

the environment’, driving global climate change, eutrophication, 

degradation of ecosystems and biodiversity loss*. At the same 
time, the world’s human population is projected to grow from 7.2 billion 
people to 9.6 billion by 2050 (ref. 3). Although most agro-ecosystems 
have coped with anthropogenic pressures’, we cannot assume they 
will continue to meet our increasing demands*. Food-production sys- 
tems are now global, with attendant benefits and risks®; the diversity of 
farmed crops is declining’; and environmental degradation from agri- 
culture is widespread*’. These trends are eroding the resilience of agro- 
ecosystems to anthropogenic perturbations such as climate change®"””. 

Reconciling the demands of the growing human population with 
ecological sustainability is increasingly difficult’. The Millennium 
Ecosystem Assessment" classified ecosystem goods and services (ESS) 
into four categories: provisioning, regulating, supporting and cultural. It 
also acknowledged that ecosystems can provide or contribute to disser- 
vices, such as pathogens and floods. Subsequent analyses have generally 
focused on single services, or on ESS as outcomes of ecosystem-focused 
or food-production-focused models’. The underlying drivers of 
ecosystem degradation are, however, economic activities that are not 
themselves ecosystem-focused’* and may be separated from their own 
consequences by long socioeconomic supply chains”. ESS research has 
concentrated on ecosystems”, rather than the institutional, political and 
socioeconomic drivers of ecological change”'. Even the recognition that 
monitoring ESS requires not only ecological but also socioeconomic 
data is relatively recent” and has not yet influenced the ways in which 
important policies, such as international trade agreements and develop- 
ment goals, are designed and implemented”. 

Social-ecological systems are complex and adaptive, and attempts 
to manage them often have unintended consequences'’”*. To manage 
ESS sustainably, we need to understand the trajectories of change that 
have produced our current situation and continue to shape it; the inter- 
actions, feedbacks and trade-offs between different services and the 
social—ecological interactions that produce them; and the ways in which 
fundamental structural changes (those that require new system mod- 
els, rather than simply adjustments to existing models) occur within 
the ESS context. Developing this understanding requires us to connect 
people and ecosystems in an interdisciplinary social-ecological systems 


framework™”*. We address this challenge by proposing a simple concep- 


tual model that shows how a systems perspective on ESS, in the context 
of agricultural transitions and increasing urbanization, helps to explain 
ecological over-exploitation. 


Service shifts in agricultural transitions 

Ina sense, human history is the story of the great transition from hunt- 
ing and gathering, through farming, to the present situation in which 
less than 1% of the population (in many industrialized countries) is 
directly employed in food production” ”. In Germany, for instance, 
the average farmer in 1950 fed 10 individuals, but in 2010 he or she 
was feeding 131 people”. Despite its importance, there have been few 
attempts to develop formal social-ecological models of this transition. 
Generic systems models of ESS are also surprisingly hard to find. Most 
empirical analyses apply to individual goods or services, such as water 
flows or climate regulation”. A few published studies have quantified 
trade-offs between different ecosystem services*’~*°, but we are unaware 
of any formal, causal systems models that provide a broad overview of 
ESS across multiple categories and scales. Despite the existence of a wide 
range of land-cover change models”, and a growing interest in trans- 
forming cities for greater environmental sustainability”, the changes in 
service provision that are likely to happen during the transition from an 
agricultural society to an industrial society are poorly specified. 

If a human population is stable and most of the people depend 
directly on ESS, the feedbacks from ESS to human well-being are clear. 
Many cultures have developed rules and traditions that, under normal 
conditions, maintain their own resource base (for example, Balinese 
water temples as irrigation systems”; the protection of sacred forests 
in southwest Madagascar”; and the release of trained eagles, once they 
reach 5 years old, back to the wild by the Kazakh Golden Eagle hunters 
of western Mongolia”). Although rules for natural resource manage- 
ment are not always effective, ifa local equilibrium between resource 
use and human population size is maintained, a ‘green loop’ that avoids 
long-term degradation of ecosystems can be sustained. 

The green loop starts to break down when human populations grow 
as a result of technological change that increases food supply and life 
expectancy. Population density and infrastructure increase as urban 
settlements create alternative livelihood opportunities, provide security 
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Figure 1 | The green-loop to red-loop transition. In this transition, as the 
population grows, the red loop overwhelms the green loop to become the 
dominant regime driving the use of ecosystem goods and services (ESS). a, 
In the starting green loop, rural populations manage their local ecosystems. 
b, As the population grows, a ‘shadow’ red loop begins to develop; changes in 
socioeconomic variables, such as increased demand for food, fibre and fuel, 
lead to greater local ecosystem impacts. c, The red loop gains prominence as 


and increase economic, social and political complexity. However, urban 
dwellers typically have less contact with their primary resource base”. 
Over time, the ability of local ecosystems to supply a full range of ESS 
to growing settlements is reduced by one or more possible causes. 
First, the area required to meet the needs of each family exceeds the 
boundaries of the area that they cultivate, and the needs of the entire 
population exceed the resources that they can access directly. The local 
ecosystem cannot produce enough food, particularly during periods of 
adversity. In addition, as settlements grow, the surrounding ecosystems 
are increasingly modified for provisioning services such as food and 
water, often at the expense of other kinds of ESS. Second, increasing 
population density makes simple forms of waste disposal impractical, 
necessitating technological solutions. Third, as the settlement grows 
the logistics of access to ESS and travel time make it impractical for 
each household to extract everything that it needs from the local eco- 
system. This creates demands for trade, technology and infrastructure 
to enhance resource-use efficiency (particularly of land and water), 
and the need for specialized production roles (for example, farmer or 
blacksmith)”. Last, an institutional environment that enables specializa- 
tion and exchange, as well as the planning and maintenance of public 
infrastructure, requires individuals such as administrators, merchants 
and law-enforcers who do not contribute directly to food production, 
further distancing individuals from ecosystems. As societies find solu- 
tions to these challenges, local economies and populations grow and the 
tasks that people perform become more specialized”. These changes 
gradually transform a system in a green loop to one in a ‘red loop 
(Fig. 1) as three trends unfold. First, demands for non-ESS continue to 
increase, resulting in changes in institutions (rules, laws, policies and 
customs) and governance systems as well as the construction of infra- 
structure (housing, roads and reservoirs). Second, urban settlement 
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demand for services shifts from a need for ecosystem services to a need for non- 
ecosystem services. d, As the demand for services shifts, the red loop becomes 
the dominant driver in the flow of ecosystem services and is accompanied by 
an upscaling and specialization process that results in the gradual alienation of 
urban people from the ecosystem; the strength of the connection between the 
local ecosystem and society is heavily reduced (dashed line). This can easily 
result in over-exploitation of ecosystems and ecological degradation. 


and specialization foster technological progress, which is then pursued 
systematically. Technological progress in agriculture, specifically, can 
strengthen both population growth and urbanization*™*. Third, because 
provisioning from local ecosystems can no longer meet local demand, 
many needs that were formerly met by local ecosystems are outsourced, 
resulting in an increase in the geographic extent of supply and demand 
(upscaling). This trend is reinforced by developments in transportation 
technology and the demand for foreign products (for example, spices 
and precious metals) by a growing and increasingly wealthy population. 
Thus, a gradual transition occurs from an economy based on ESS to one 
based on non-ESS and remote extraction. In the process, the perceived 
importance of ecosystems to people decreases. The proportion of people 
who extract goods directly from ecosystems (farmers, fishers and log- 
gers) declines and their status might be reduced. During the transition 
period, which may last for decades, elements of both red and green 
loops coexist (Fig. 2). Typically, this increases socioeconomic diversity, 
spatial heterogeneity and inequity, often with the formation of spatial 
gradients in ecosystem service provision and socioeconomic variables 
related to proximity to various resources”. 

The socioeconomic dynamics that are driven by growing markets for 
non-ESS, together with upscaling, ongoing technological change and 
related acceleration of population growth, have many consequences 
for ESS. Society’s ecological footprint grows”. As people’s reliance on 
ecosystems becomes less obvious, they become less aware of ecologi- 
cal degradation and less concerned about it. They might also be too 
overwhelmed by local change to pay attention to their regional and 
global impacts”. As the connections between food production and food 
consumption (as well as feed and fuel production and use) become less 
apparent, societies unintentionally place increasing pressure on dwin- 
dling resources. In addition, the ability of people to censure others for 
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abusing natural resources declines because social interactions between 
producers and consumers weaken. 

Once the transition from green-loop to red-loop dynamics is 
underway, the red loop becomes the dominant driver of societal 
change. Institutions and actions that conserve ESS and contribute to 
their sustainability must then be negotiated in new action forums” in 
which many powerful and often competing actors, such as politicians, 
mining corporations and manufacturing industries, push to enhance 
the provision of non-ESS, often at the expense of ESS. The shift from 
green-loop to red-loop dynamics thus underpins a gradual regime 
shift” in the entire social-ecological system. 

Although the transition occurs gradually, the shift from a green to 
a red loop represents a fundamental change in system functioning 
that requires two different system models, rather than parameter 
changes within a single model. The two loops are alternate social- 
ecological states, each of which has reinforcing feedbacks that buffer 
it from change. The key slow-changing variables in the system”! are 
increasing human population and population density, which create 
amplifying feedbacks that rapidly ratchet up the demand for ESS 
and non-ESS; technological change, which accelerates population 
growth and enables a growing proportion of people to obtain their 
livelihoods in ways unrelated to agriculture; and a loss of biodiversity, 
which can lead to eventual socioeconomic collapse. 

During transition, the proportion of aggregate income obtained 
directly from ecosystems declines from high (green loop) to low (red 
loop)***’. A shift occurs from high to low relative prices for basic 
ESS (provision of staples such as wheat, potatoes and cassava) and to 
higher demand for special commodities (luxury food items such as 
a range of fruits)**. The value of sustaining and regulating services 
also increases while the value of provisioning services declines. For 
example, access to electricity and fossil fuels in cities reduces the reli- 
ance on local wood-fuel production”. Demands for cultural services 
might change with peoples’ perception of nature, and willingness to 
pay for cultural services may increase as natural landscapes become 
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Figure 2 | Detailed interactions and feedbacks during the transitional 
period between green and red loops. Basic household needs create a local 
demand for ecosystem goods and services (ESS). This may be expressed as 
direct and unregulated impacts on ecosystems, or, more typically, as ‘use 
actions’ (consumptive and non-consumptive) that are governed by rules, laws, 
policies and customs (institutions). Among use actions, those that have the 
highest ecological impacts are generally those that involve direct extraction of 
resources (for example, logging, cultivation or water extraction). Use actions 
affect the provision of ESS as well as ‘disservices’ (pathogens, crop damage 
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scarcer. For example, temples and shrines in heavily populated cit- 
ies in Japan and Thailand have found new significance as places in 
which to experience nature*’. Upscaling and increased trade in a red- 
loop population are not necessarily unsustainable if they lead to an 
equilibrium between the human population and resources at larger 
geographic extents. 

Lock-in to a red loop need not be an ‘end point’”’. Demand for 
new cultural services, such as walking trails and ecotourism, could 
lead to a reintegration of local ESS into urban economies and politics 
(although ecosystems may be species-poor by the time this occurs). 
Concerns for human health could also lead to measures to prevent 
environmental degradation. Ageing and declining post-peak human 
populations will bring new dynamics and possibly, if sufficient bio- 
diversity remains, the potential to return to more direct interactions 
with ESS. It remains unclear, however, whether efforts to ‘re-gree’ 
cities (for example, through urban rooftop gardens”) can persist as 
the human population continues to grow, and whether cities will 
become unsustainable without efforts to make them greener and 
more self-sufficient. 

The transition from green- to red-loop dynamics occurs through 
feedbacks between technological change, population growth and eco- 
system change. The resulting red loop has the potential to sustain- 
ably reconcile these forces by solving service supply and distribution 
problems. There may, however, be hurdles that prevent a successful 
transition and/or reduce the sustainability of the red loop. Over- 
consumption in the red loop and failure to regulate ecological decline 
can produce a ‘red trap. Rural poverty and ecological degradation in 
the green loop may reinforce each other, leading to a ‘green trap’ In 
both cases, systems must reorganize or they will collapse (Fig. 3). 

Tests of our model require long-term time series data for agricul- 
tural production, demography, economic developments and ecosys- 
tem change (Table 1). As a first step towards grounding the model 
empirically, we review evidence from three case studies: Sweden, as 
an example ofa green-loop to red-loop transition; the Sahel, focusing 
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or floods). The degree to which human needs are met by ecosystem services 
then affects future demand, completing the loop. The direct interactions of 
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focus is non-ESS, despite the continued importance of ecosystems for the 
community. Ongoing local and regional impacts on ecosystems are hidden 
from urban dwellers by outsourcing and infrastructure development. The two 
‘wild card’ variables (in grey), human health and regional processes, may be 
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socioeconomic surprises that can alter system dynamics. 
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on Niger, as an example of a green-loop to green-trap transition; and 
Beijing, as an example of a red-loop to red-trap transition (in the 
absence of an unequivocal contemporary example of a country in 
a red trap). 


Green loop to red loop in Sweden 

For more than 1,000 years, Sweden had low population levels and a 
dominantly agrarian lifestyle, consistent with a green-loop dynamic. It 
still has one of Europe’s lowest population densities (around 9.5 million 
inhabitants; 21 people km~), but between 1750 and 1850 the population 
doubled and its subsequent growth was much faster. Around 1870 
to 1890, population growth triggered a switch from a green loop (or 
possibly even a green trap; more than 1% of the population emigrated 
to America every year during the 1880s”) to a red loop. Rapid economic 
development, fuelled by engineering, mining, and the steel and pulp 
industries as well as internal institutional changes and a growing export 
market, took place between 1870 and 1914 (ref. 62). Two world wars 
and a global recession reduced economic growth, but gross domestic 
product (GDP) grew rapidly after 1950 (ref. 60). 

Since 1950, industries and business services have expanded, whereas 
agricultural production has remained relatively constant. Infrastruc- 
tural assets — buildings and machinery — grew by two orders of mag- 
nitude from 69 billion year-2000 Swedish kronor in 1965 to 2.5 trillion 
year-2000 Swedish kronor in 2000. By contrast, employment in agri- 
culture declined from nearly a million people in 1880 to under 50,000 
in 2000, and roughly 20% of agricultural land was removed from pro- 
duction between 1949 and 1999 (ref. 63). Sweden managed to retain 
substantial natural resources through its agricultural transition. In 2013, 
69% of the country was forested, 8% was used as agricultural land and 
only 2.8% was ‘built-up. Transport infrastructure (roads, railways, 
harbours and airports) accounted for 40% of built-up land, with total 
infrastructure occupying a greater area than residential dwellings. 

Advances in technologies and farming methods seem to be helping 
Sweden to remain sustainably within a red loop, with reductions in local 
environmental degradation and stable or increasing food production. 
According to Statistics Sweden, total household water withdrawals 
between 1995 and 2010 declined from 616,000 m’ to 576,000 m? per 
year; and for agriculture, from 137,000 m? to 99,000 m’. Nitrogen inputs 
into water bodies declined from 34,527 t in 1995 to 24,416 tin 2005 and 
for phosphorus, from 970 t to 733 t during the same period. From 1965 
to 2012, the area farmed and the numbers of individual people engaged 
in farming strongly declined, but yields per hectare of wheat (summer 
and winter) increased from 6,880 kg ha‘! to 11,110 kg ha‘ and annual 
production more than doubled from 1,039,320 t to 2,289,300 t. 

Ina red loop we expect a disconnect between people and local eco- 
systems, with negative consequences for biodiversity and ESS. The 
available evidence supports this view. The greatest losses of Swedish 
grasslands, one of the country’s most species-rich habitats, occurred 
before 1950 and created an extinction debt for habitat-specialized 
vascular plants, with species still being lost from the remaining grass- 
lands®. Extensive loss of old-growth forest dates back to a 1948 policy 
that enabled clear-cutting and over-use of herbicides in Swedish forests. 
The remaining forests of high conservation value are considered too 
small and too fragmented to meet current forest and environmental 
policy goals. The Swedish Environmental Protection Agency was 
only created in 1967, and biodiversity conservation has only been a 
nationally agreed objective of forest management since 1992 (ref. 66). 
In the Baltic Sea, the Swedish cod harvest peaked at 59,000 t in 1984, 
but had dropped to 16,000 t by 1993 as the fishery collapsed”. A review 
of the impacts of agricultural intensification on essential ESS in Swe- 
den between 1950 and 1999 (ref. 63) found that most of the measures 
indicated a loss of ESS from the Swedish agricultural landscape. These 
included a 60% decline in native pollinator abundance and 46%, 33% 
and 14% increases in the concentrations of the heavy metals mercury, 
cadmium and lead, respectively. 

Sweden has met many of its needs by upscaling, as predicted by our 
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Figure 3 | States, traps and transitions along the rural to urban 

gradient. A typical development trajectory from an agricultural (green loop) 
to industrial (red loop) society involves individual households gaining wealth 
while some level of ecosystem degradation occurs. Depending on population 
growth rates and governance, societies may grow without true socioeconomic 
restructuring (green trap) or become rich and continue to over-exploit 
ecosystems (red trap). The dashed lines indicate avoidable transitions. Both 
traps can lead to socioeconomic collapse. One of the primary challenges of 
development and policy initiatives is to shift societies from a green trap toa 
red loop (dotted line) without heavily altering consumption patterns, thus 
maintaining a relatively high individual quality of life without entering a red trap. 


Ecological degradation 


model. The ecological footprints of large cities in the Baltic Sea region 
for food and timber production and waste assimilation are more than 
565-1,130 times their combined area®. Swedish imports and exports 
increased sevenfold from 1975 to 2000, with unknown ecological 
impacts on remote locations. Internal biodiversity loss or an external 
limit to growth (such as climate change) may yet affect Sweden's econ- 
omy”. We could, however, find no obvious evidence that Sweden has 
entered a red trap. For the moment, it is an example of a shift froma 
green to a red loop that first increased and then reduced the impacts of 
the growing human population on local ecosystems. 


Green loop to green trap in the Sahel 

In Niger, millennia-old, environmentally specialized societies of pasto- 
ralists, agro-pastoralists, fishermen and traders indicate the long-term 
adaptations of people to ecosystem limitations and opportunities. In 
some parts of the pre-colonial Sahel, the wealth created as a result of 
labour division and the inter-regional trade of gold, salt and slaves led to 
the formation of cultural centres such as Timbuktu and Djenné in Mali 
between the thirteenth and sixteenth centuries”, proving the economic 
success of combined trade, regional migration and agro-pastoralism in 
successfully defying unpredictable rainfall. The existence ofa relatively 
sparse rural society in the Sahel for several thousand years suggests a 
stable green loop. 

The slave trade during the eighteenth and nineteenth centuries 
resulted in the loss of up to 3 million African inhabitants, affecting the 
workforce and cultural progress. Population recovery during the twenti- 
eth century rapidly led to a shortage of fertile land. Together with erratic 
rainfall, low soil fertility has, for centuries, limited the effectiveness of 
agricultural intensification efforts”. Shorter fallow periods have led to 
the expansion of cropping systems into ever more marginal drylands. 
The resulting large drop in per capita cereal production has required 
rapidly increasing cereal imports” (Fig. 4). Although between 1970 and 
2012 the area of harvested cereals expanded from 2.3 million to 10 mil- 
lion ha, cereal imports increased from almost zero to 340,000 t (Fig. 4). 
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Table 1| The main premises (both well proven and those for which the evidence is circumstantial) underpinning our model, and forms of 
evidence on which proof or disproof of our argument rests 


Model stage, prediction or hypothesis 


Evidence that would support the model 


Relevant data 


Relatively stable populations of low 
densities are maintained with an agrarian or 
pastoral lifestyle. 


Low population density is, or was, 
ecologically sustainable over timescales of 
centuries. 


Population increase leads to an increase in 
the number and size of cities (and/or land 
degradation and poverty). 


In cities, the proportion of household 
income from agriculture drops as the 
society enters a red loop. The agricultural 
transition divides urban (red loop) and rural 
(green loop) people. 


Once ina red loop, upscaling of production 
systems must occur to meet the food 
demands of the urban population. 


Red-loop dynamics reduce the connections 
of city dwellers to the countryside, fostering 
further ecological degradation. 


Existence of a green trap. 


Lower population density before the formation of cities. 


Low population density did not lead to degradation of 
ecosystem services (in addition to evidence of more than 
250,000 years of human existence in Africa). 


Increasing urbanization, declining per capita agricultural 
production, declining household smallholding sizes as well 
as intensification as a temporary fix, or failure of agricultural 
production to sufficiently increase to meet demand. 


Differences in household income sources between rural 
and urban dwellers, declines in proportion of income from 
agriculture (as an income source) or the increasing role of 
non-ESS. 


Upscaling, for example, greater ecological impacts on the 
surrounding countryside, impacts of urban demand on rural 
production systems and markets, and increased importance 
of trade. 


ncreasing rates or magnitudes of ecological impacts as 
urbanization levels increase, with less obvious dependency 
by city dwellers on provisioning ecosystem services, and 
increased ignorance about ecosystems (for example, where 
food comes from or what natural habitats really look like). 


Population increase is possible without urbanization (or 
the total population may grow more rapidly than the urban 


People per hectare before urbanization, showing 
evidence of stability in numbers. 


Estimates of how much land was needed for 
sustainability, for example, the number of hectares per 
household needed to maintain shifting agricultural 
system productively for more than 50 years, and proof 
that this much land was available. 


City sizes, urban population demographics and urban 
growth rates; per capita production of key food crops; 
and village, farm or smallholding sizes. 


At the microscale, household-level data on net income 
and sources of income; at the macroscale, agricultural 
production as a proportion of GDP between urbanized 
and developing countries; and data on service 
industries and government or city expenditures. 


Data on food prices, diversity and demand from city 
dwellers (compared with rural dwellers); rates of land 
conversion around cities; and per hectare production 
of crops in relation to market growth. 


Data on land-cover change and biodiversity loss 
as urbanization occurs, ideally compared with 

a dysfunctional green-loop situation (increasing 
population and declining quality of life). 


Data showing increasing rural population and 
declining per capita production. 


population). 


Existence of a red trap. 


Potential for collapse. 
trap and red-trap situations. 


Potential for shifting from a green trap toa 
red loop. 


Unsustainable consumption by wealthy societies. 


The demonstration that collapse is possible from both green- 


Urbanization and migration can provide a short- or 
intermediate-term solution to rural poverty. 


Despite arguments for the existence of red traps based 
on archaeological data, because of global upscaling 
few, or no, clear-cut contemporary examples exist. 


Archaeological evidence for social-ecological collapse 
in past civilizations, both agrarian and urbanized. 
Contemporary examples are harder to find because of 
technology and globalization. 


Data on household incomes for societies (for example, 
Gini coefficients) as they go through a transition. 


GDP, gross domestic product; ESS, ecosystem goods and services 


These trends correspond with a shift from a green loop to a green 
trap, in which poor rural populations remain enmeshed in rural poverty. 
Apart from during the two big Sahel droughts (in the early 1970s and 
mid-1980s), Niger has coped with the per capita decline of its rain- 
fed cereal production by upscaling. In our model, this indicates a shift 
towards a red loop. However, the economic basis for imports was the 
uranium boom — recently complemented by revenues from oil and 
gold — which resulted in an availability of funds without the creation of 
a full set of economic, infrastructural and institutional assets that would 
characterize a red loop. The decline in demand for nuclear fuel during 
the 1990s therefore resulted in a food crisis and political instability. 

In response to the green trap, the rural population of Niger migrated. 
In 1951, the urban population was 6% of the country’s 3.3 million 
inhabitants; by 2012 it was 17% of 16.6 million”. As people in Niger 
attempt to escape the green trap, the intensive production of vegetables 
in urban and peri-urban agricultural systems and in irrigated garden- 
ing systems of southeastern Niger has increased; for example, onion 
sales in the Maradi region increased from 26,000 t in 1961 to 370,000 t 
in 2011 (ref. 72). Imports of staple foods, largely financed by foreign 
aid, have allowed the urban population and its alienation from ESS to 
continue to grow (upscaling based on external economic resources). 
Upscaling of demand without expansion of local supply has led to fur- 
ther neglect of the rural sector, putting additional strain on ecosystems, 
leading to more ecosystem degradation, and making it increasingly dif- 
ficult to escape the green trap. For example, reliance on wood fuel from 
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the marginal shrublands that formerly surrounded the cities” has led 
to the widespread loss of vegetation cover and a decline in associated 
regulating and supporting services. 

Agricultural innovations proposed for the Sahel over the past 50 years 
have largely failed because food production is hampered by a combina- 
tion of climatic unpredictability and political neglect”*. A few exam- 
ples from Sudano-Sahelian West Africa’”*”® indicate that agricultural 
intensification with positive feedback loops to ESS is possible, in prin- 
ciple, in this region. It depends, however, on effective local policies, 
risk-reducing technologies and stable market demand for commodities 
that support farmers’ investments in agriculture (as well as curbing the 
present 3.9% per annum population growth rate). 


Red loop to red trap in Beijing 

Beijing is situated on the fertile North China Plain. It has an average 
annual precipitation of 578 mm and relies, for staple foods, on an inten- 
sive double cropping system of maize (corn) and wheat. Although the 
lack of water has limited the development of the Beijing basin area for 
centuries, and despite China’s one-child policy, the greater metropolitan 
area has grown from 9 million inhabitants in 1978 to more than 17 mil- 
lion in 2009 (ref. 77). 

Population growth, exacerbated by immigration, rapid industrializa- 
tion and changes in consumer demands, has led to an ever-increasing 
demand for water resources. Beijing's per-capita water-storage capac- 
ity of 300 m’ is 12.5% of China’s urban average and 3% of the world’s. 
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Figure 4| Development of cereal production in Niger between 1960 

and 2014. Cereal data illustrate the failed transition from a green toa red 
loop, and the resulting entry into a green trap. Cereal yield per hectare (a) 
has declined and the large expansion of the cultivated area (b) is primarily 
responsible for the increase in total cereal production (c), despite an overall 


Overuse of ESS is evident: 60% of Beijing's total water use, and 80% of 
its irrigation water, is fossil groundwater, which is unrenewable. Aver- 
age water table levels in 2000 were 8.1 m lower than in 1980 and 12.2 m 
lower than in 1960 (ref. 78). In 2005, agriculture consumed 38% of the 
total water, for industry the value was 20%, and for municipal and resi- 
dential purposes it was 39%; the latter is rapidly increasing, leading to 
fierce competition between these sectors”. Around 70% of the irrigation 
water in the North China Plain is wasted by evaporation, deep percola- 
tion or run-off”. China’s central government has now implemented 
measures, such as the use of plastic mulching and tree-crop interplant- 
ing on large areas, to enhance water conservation. 

High-intensity agricultural production in the Beijing area satisfies 
only 17% of city dwellers’ demand for grain and 31% of their demand for 
vegetables”. Heavy environmental contamination has occurred from 
uncontrolled wastewater discharge into water bodies, nitrate leaching 
from over-fertilization, and the release of gaseous pollutants and aero- 
sols into the atmosphere. An estimated 75% of urban residents in China 
live in areas in which the air quality is below the country’s own stand- 
ards; fine particle, emissions of sulphur dioxide and nitrogen oxides 
and subsequent fallout of acid rain® affect an even higher proportion 
of Beijing’s population. In 1997 the nationwide death toll from air pol- 
lution was already estimated to be 300,000 people per year®. Annual 
total aerial nitrogen deposition rates in China rose from 13 kg N ha’' 
in the 1980s to 21 kg N ha “in the 2000s, of which agricultural nitrogen 
sources contributed two-thirds”. Recent data for the Beijing area show 
annual total dry and wet nitrogen depositions of more than 90 kg ha’' 
per year®’, resulting in widespread acidification of the generally well- 
buffered surface soils of China’s croplands™. 

In recent decades, per capita income in Beijing has risen faster than 
the cost of living and the proportion of household income spent on food 
has declined. Hence, Beijing has witnessed a transition to urban life- 
styles, a growing dependence of food markets on distant ESS (upscaling) 
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increase in fertilizer use (d). Cereal production per 1,000 people (e) dropped 
slightly as the human population grew (f). This would have led to a decline 
in cereal availability per capita. However, since the 1990s, extensive cereal 
imports (g) have compensated for the shortfall, with the linked collapse of 
cereal exports (h). 


and the breakage of direct feedbacks from local ecosystems to the local 
population. So far, upscaling seems to have been a successful strategy 
for dealing with a potential red-trap situation. However, it is unclear 
whether further development in Beijing will be sustainable given ongo- 
ing declines in ESS and human well-being”. 


General implications 
Although ecosystems are the foundation on which non-ESS rest, 
demands by urban societies for non-ESS make the connections between 
humanity and ecosystems less obvious and less immediate. The social- 
ecological dynamics of ESS are strongly driven by the more general 
demands of society for non-ESS and by the changes in the scales of 
supply and demand, for both ESS and non-ESS, that accompany the 
transition from agricultural to industrial societies. The first point in 
particular has not been incorporated into the ESS literature. 

Agricultural transitions are fundamentally linked to human popula- 
tion growth’. Growing societies that attempt to remain ina green loop 
will almost inevitably enter a green trap, which could result in greater 
biodiversity losses than a red loop”. Few contemporary societies exist 
ina green loop, and those that come closest to doing so are often socially 
and economically marginalized and vulnerable to external exploitation 
of their ecosystems***’. Contemporary societies that seem to have best 
navigated a balance between ecological sustainability and human well- 
being are those, such as Sweden, that have entered a red loop without 
shifting exploitation to red-trap levels. The red loop has bought such 
societies additional time, and the best-case scenario is that socioeco- 
nomic feedbacks within the red loop (for example, declining fertility, 
or simply longer inter-generational times and smaller families) could 
reduce population growth and ecological footprints before these sys- 
tems enter a red trap and collapse”. 

Scale is of critical importance here: the cumulative effect of many 
local or regional red loops may be a global trap, for example if their 
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combined greenhouse-gas emissions trigger climate change. We would 
expect to find scale dependencies in the relative importance of differ- 
ent links in the model. At local extents, questions of access and infra- 
structure development may dominate red-loop ecological impacts. At 
national extents, the model may capture the basis of an economy as 
rural or urbanized, and upscaling can be perceived as globalization. For 
empirical analyses, we suggest an initial unit of analysis as the house- 
hold, with aggregation of household data across a range of different 
spatial and temporal scales, and institutional levels. 

Our model shares some elements with the environmental Kuznets 
curve’, which suggests that indicators of environmental degradation 
follow an inverted U-shaped curve over the course of economic devel- 
opment. We do not wish to reinstate Kuznets’ hypothesis, which has 
been criticized”””’; but the different pathways that we have identified 
explain why the environmental Kuznets curve might apply to some 
societies (such as those undergoing a green- to red-loop transition) 
and not to others (such as those that are caught in, or heading towards, 
red or green traps). 

It remains unclear whether, how and when ecological debts incurred 
during industrialization will have to be repaid. Human survival depends 
on maintaining functional, resilient ecosystems and resulting ESS 
through the bottleneck of maximum human population. The loss of a 
crucial proportion of Earth’s fauna during the next 50-100 years would 
be irreparable over the time frame of human existence, and future socie- 
ties may struggle to live sustainably if left with unstable, depauperate 
life-support systems. 

Our model has some parallels with existing research on traps and 
transformations”, with many relevant details and implications that 
will take time to work out. It is not directly diagnostic or prescriptive 
but it has the potential to both explain and predict, in the context of 
ESS and agriculture, the creation and resolution of scale mismatches” 
and the development of various systemic syndromes (such as the reten- 
tion of perverse incentives and subsidies”, or the continued presence 
of destructive feedback loops that are almost impossible to break). The 
transition from green to red loops may also help to explain collapses 
in some past societies, providing a translation mode (in moving from 
theory to empirical, testable hypotheses) for ideas about social complex- 
ity and adaptive cycles’. A diversity of data-intensive, comparative case 
studies is needed to test and refine these ideas; developing economies 
and fast-growing cities should be particularly fertile grounds for fur- 
ther research. Ultimately, we see in these ideas the basis for a scientific 
framework that would explain why humanity’s use of ESS is, despite 
our combined knowledge and expertise, rapidly approaching planetary 
boundaries. = 
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Learning to coexist 
with wildfire 


Max A. Moritz’, Enric Batllori’’, Ross A. Bradstock?, A. Malcolm Gill’, John Handmer’, Paul F. Hessburg, 
Justin Leonard®, Sarah McCaffrey*, Dennis C. Odion’, Tania Schoennagel® & Alexandra D. Syphard? 


The impacts of escalating wildfire in many regions — the lives and homes lost, the expense of suppression and the damage 
to ecosystem services — necessitate a more sustainable coexistence with wildfire. Climate change and continued develop- 
ment on fire-prone landscapes will only compound current problems. Emerging strategies for managing ecosystems and 
mitigating risks to human communities provide some hope, although greater recognition of their inherent variation 
and links is crucial. Without a more integrated framework, fire will never operate as a natural ecosystem process, and 
the impact on society will continue to grow. A more coordinated approach to risk management and land-use planning 


in these coupled systems is needed. 


ire is unique among the natural hazards that affect human 

communities and the ecosystems on which we depend’. Although 

humans sometimes intentionally ignite and manage fires, our main 
focus is on fighting them. For other natural hazards, such as earthquakes, 
hurricanes and floods, there is much more emphasis on identifying vul- 
nerabilities and adaptations. The ‘command and control’ approach” 
typically used in fire management neglects the fundamental role that fire 
regimes have in sustaining biodiversity and key ecosystem services” ®. 
Unless people view and plan for fire as an inevitable and natural process, 
it will continue to have serious consequences for both social and ecologi- 
cal systems. 

Over the past two decades, wildfires around the world have increasingly 
affected human values (for example, lives, views or sacred environments) 
and assets (for example, damage to homes or public infrastructure) and 
ecosystem services (for example, air quality and long-term carbon stor- 
age). The growing list of negative outcomes and their financial effects have 
complex causes and consequences’. The natural range of fire sizes and 
resultant frequencies, timings and intensities — the ‘fire regime — var- 
ies greatly among ecosystems, as do the ways in which human activities 
have altered them (for example, through timber harvesting, fire suppres- 
sion, urban or agricultural encroachment, novel ignition patterns and 
invasive species). Not surprisingly, policy strategies to address wildfires 
often emphasize fuel reduction®’. However, even where strategies rec- 
ognize interacting cultural, environmental and economic dimensions 
of wildfire’, few tackle the difficult land-use issue of where and how 
humans choose to build their communities in the first place. The prospect 
of widely increasing fire activity with climate change” intensifies the need 
for anew path forward. 

Viewing fire-related problems in the context of coupled socioecologi- 
cal systems (SESs)"*, which explicitly recognize links between humans 
and their natural environments, provides insights into achieving a more 
sustainable coexistence with wildfire. We have learned a great deal about 
fire as an essential ecosystem process and the human dimensions of living 
on fire-prone landscapes. Synthesis of this knowledge through a coupled 
systems approach can highlight specific vulnerabilities and trade-offs, and 
facilitate adaptation strategies across widely varying public and private 


landscapes (Fig. 1). In this Review, we summarize research on fire-prone 
ecosystems and fire effects on human communities through the lens of 
SESs, identify links in these coupled systems, and discuss recommenda- 
tions for greater resilience. We emphasize insights from three regions 
(Fig. 2) where major fire-related losses have occurred in recent decades: 
the Mediterranean basin, the western United States and Australia. 


Socioecological systems and fire 

Sustainable solutions to most environmental problems will be impossible 
if the links and interdependencies between humans and ecosystems are 
ignored”. In the context of wildfire, the most well-developed SES research 
that incorporates this coupling concerns climate-change effects on Alas- 
kan boreal forest ecosystems and rural indigenous communities’”"®. Case 
studies in rural communities of New Zealand” and California’ also exist. 
Remarkably, a coupled wildfire SES framework has yet to be adopted 
for the more densely developed wildland-urban interface (WUI; area 
in which communities intermix with or abut natural vegetation), where 
most of the human fatalities, home losses and fire-suppression expendi- 
tures occur. 

The complexity of how wildfire operates in different ecosystems and 
how humans interact with it indicates that place-based hazards and risks 
should be addressed as a coupled SES'*””. Reframing the problem to mini- 
mize harmful effects as the climate changes and humans increasingly 
inhabit fire-prone landscapes identifies an integrated set of coupled SES 
linkages (Fig. 1). Importantly, this allows us to recognize how the geo- 
graphic context of the coupling itself contributes to impacts and losses of 
assets throughout the wildfire SES. Local characteristics of the WUL, and 
the components on either side of it, will largely determine the degree to 
which fire may be accommodated and how communities will be affected. 
The spatial scale of the coupling may also be broad in some cases, such 
as when fires compromise recreation values (for example, trail access, 
camping facilities or fishing habitat) and water supplies of distant urban- 
ized areas, or when concerns over human exposure to drifting smoke 
influence management decisions about fires that are burning relatively far 
away. Although this framing does not intrinsically address connections 
between fire and global-scale climate change mitigation’*’>”, it helps to 
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reveal geographically relevant solutions for decreasing harmful effects and 
increasing the positive benefits of fire on the landscape. The institutional 
complexity that underlies many aspects of this coupled SES framework 
— agency mandates, property rights, building ordinances, indigenous 
governance, economic subsidies and political pressures — will also feed 
into a particular set of solutions, often creating challenging constraints. 

Sustainable coexistence with wildfire is both a process anda long-term 
goal, such that policy, planning and management are adapted and refined 
through time (Fig. 1). Responsibility must be shared between govern- 
ments and the people at risk, and the approach integrates building, plan- 
ning, fuel management, suppression capability, and knowledge of fire 
and ecosystem dynamics at different scales. Coexistence with wildfire 
should ultimately allow ecologically appropriate fire regimes to operate 
on landscapes near and far from the WUL, with relatively low risks to 
people, property and resources, while also allowing us to enjoy ecosystem 
services enhanced by fire (for example, habitat maintenance, potential 
hazard reduction, natural hydrologic functioning, and carbon and nutri- 
ent cycling). This outcome should also reduce the costs of fire suppression 
and the need to put firefighters at risk. 


Fire and ecosystems 

The role of fire in different ecosystems varies by the degree of cur- 
rent landscape modification, relative to natural or historical patterns 
and processes. Some regions have large expanses of semi-wilderness 
where maintenance or restoration of certain fire regimes is crucial to 
ongoing habitat characteristics or ecosystem services (for example, 
the western United States and Australia). Here the links between fire 
characteristics and ensuing ecological effects, or fire ‘severity, are 
often emphasized. Other regions have been so completely altered for 
various human needs that what is ‘natural’ is no longer a clear con- 
sideration (for example, the Mediterranean basin). Furthermore, cli- 
matic controls on fire regimes (for example, frequency of droughts or 
high-wind events, or length of fire season) tend to dominate in some 
ecosystems, whereas local controls (for example, topography, fuel 
loads and ignitions) strongly influence others. Fire resilience is thus 
context-dependent, varying with the biophysical environment and 
desired future conditions. Accordingly, our capacity to avoid eco- 
system degradation and catastrophic shifts” (Fig. 1) depends on the 
ecosystem in question and how climate change will manifest there. 


Mediterranean basin 

Mediterranean landscapes are mosaics of various shrublands and oak- 
and pine-dominated woodlands intermixed with extensive pastures, 
cultivated lands and abandoned agricultural fields”. Despite fire’s eco- 
logical influence there’, no reference conditions exist for fire manage- 
ment or restoration, and traditional use of fire for rangeland and game 
management has strongly influenced historical landscape dynamics”. 
Pronounced biophysical and land-use gradients have recently resulted 
in contrasting fire and vegetation dynamics. The southern and eastern 
regions are subject to land over-exploitation and reduction in vegeta- 
tion cover that increases the risk of desertification and loss of ecosystem 
services. By contrast, socioeconomic drivers are increasing fire hazards 
and losses over Mediterranean Europe (northern region) owing to rural 
depopulation, increased WUI exposure and land-cover changes that are 
sometimes promoted through afforestation policies”. Most shrublands 
and woodlands in the northern region are becoming dense enough to 
support climate-driven high-intensity ‘crown fires””. 

Wildfire in European Union countries is addressed in national and 
regional forest policy plans, but consensus on fire and ecosystem manage- 
ment is lacking. In spite of large expenditures, increased preparedness and 
greater firefighting abilities, extreme fire-weather conditions have caused 
devastating fires in several Mediterranean countries”. A new framework 
to regulate and promote traditional fire practices, accommodating diverse 
territorial contexts and operational use of fire, has thus been advocated”. 
Currently limited to local management, prescribed burning is increasing 
across Europe as a tool that aims to reduce fuel loads and diminish the 
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Figure 1 | Links and pathways to resilience in coupled socioecological 
systems affected by fire. Coexistence with wildfire is strongly influenced 
by the type of natural fire regimes that operate on a given landscape, and 

the degree to which communities can reduce exposure and vulnerabilities 
there. The wildland-urban interface (WUJ) is the spatial manifestation of 
the coupling, and the most proximate scale of exposure and risk mitigation. 
To learn from and minimize the harmful effects of fire in both the ecosystem 
and the community, links between systems and scales of interactions must 
be recognized. Doing so will trigger, through research and in response to 
changing social values and political context, further adaptation and change in 
policy, planning and management. 


risk of high-intensity fires*. Modest changes to regional and national 
wildfire policies have therefore included long-term preventive actions, 
but fire management is still primarily centred on short-term fuel- and 
suppression-oriented measures*. There are concerns over the ecological 
consequences of recent fire patterns”, but human-centred fire exclusion 
generally prevails on most Mediterranean-basin landscapes. 


Western United States 

Fire management in many western US ecosystems is informed by 
research on the historical role of fire*’, especially through dendro- 
chronology” and landscape reconstructions”. Before modern man- 
agement, different types of fire occurred among vegetation types and 
maintained important natural structures and functions, with great 
variation geographically>? >. 

In western US forests, high-severity fires that kill overstory trees 
are typical of cool, high-elevation, subalpine environments**””. 
Although severe fires may seem catastrophic from a human perspec- 
tive, in these forests they stimulate vegetation regeneration, promote 
landscape diversity in terms of vegetation types, provide habitat for 
many species and sustain other ecosystem services’. The many organ- 
isms and propagules that may survive the fire, combined with hetero- 
geneity in age, structure and species composition across landscapes, 
confer resilience against shifts to non-forest types. High-severity 
fires predominate across about 30% of western US forests, natu- 
rally mixing with low-severity fires through time and space across 
another ~45%°**. Key regional controls of high-severity fire regimes 
are extreme drought and high winds”, and local (for example, topo- 
graphic) influences on severity patterns can emerge during less dry 
conditions”. Fuels tend to be naturally abundant in these ecosystems, 
so modern fire suppression may have decreased historical levels of 
landscape fragmentation, but it has not increased fuel loads”. 

By contrast, many dry and mesic, low-elevation and mid-montane 
forests historically experienced more frequent low-severity fires 
that maintained relatively open forest structures of fire-resistant 
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Figure 2 | Area burned patterns and locations of fire-prone regions. The cumulative area burned between 1996 and 2012 in millions of hectares (Mha) per 
mapped cell. The western US region consists of the 11 western states in the conterminous United States (left), the Mediterranean basin (middle) contains the 
Mediterranean-climate biomes and the Australian region (right) encompasses the entire continent (see Supplementary Information). 


trees****“°, across about 25% of western US forests”’. Ignition pat- 


terns, vegetation structure and fuel amount exert a strong control on 
regimes of frequent low-severity fire, making them more sensitive 
to modern human perturbations and also more amenable to fuel- 
management techniques?“ Unlike high-severity fire regimes, 
timber harvesting and decades of fire suppression in drier forests 
have lengthened intervals, increased densities of smaller trees and 
shifted regimes of mostly low-severity fires to include more high- 
severity, stand-replacing fires. The extent to which this has happened 
is a topic of debate, raising questions about how widespread ‘mixed 
severity’ fire regimes were prehistorically*”**”. Regardless, reducing 
accumulated fuels in these forests is often a high management prior- 
ity. Only where such departures from natural fire regimes have led 
to denser, multilayered, fire-intolerant forests, however, may fuel- 
reduction treatments restore more characteristic forest structure and 
function (Box 1). 

There is a general consensus regarding the importance of fire, 
including the need for prescribed burning, to maintain native 
grasslands and open woodlands. Woody plant encroachment in 
many ecosystems with sparse tree cover, driven by a lack of fire and 
replacement of native herbivores, has reduced plant biodiversity, 
altered vegetation structure and threatened the fauna that depend 
on those habitats**“*. Fire also plays a crucial part in regeneration 
for some of the vast shrublands of the western United States, espe- 
cially California's densely urbanized chaparral ecosystems. Similar 
to high-elevation forests, fire in chaparral is stand-replacing and 
under strong climatic control (patterns of drought and extreme 
fire weather)”, meaning that fuel-reduction efforts have limited 
effect except in strategic locations**”’. Increased fire frequencies, 
due to abundant human ignitions and non-native grasses that sup- 
port rapid reburning, threaten to convert many native shrublands 
to degraded habitats**. Invasive grasses also cause very frequent 
and often large fires across parts of the Great Basin in the western 
United States**”’, driven by the ‘grass-fire cycle’ positive feedback” 
and bringing serious management challenges even to fire-sensitive 
desert ecosystems”. 


Australia 

Fire is ubiquitous in Australian ecosystems, including deserts and tropical 
forests, and a wide range of fire regimes have been mapped using remote 
sensing”. Annual pulses of relatively intense fire dominate the extensive 
savannahs of northern Australia, with less frequent, massive fires in the 
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arid zone occurring after above-average rainfall”. By contrast, large fires in 
the temperate forests of the south, although intense, are less extensive and 
also less regular (decadal occurrence). Biophysical models of fire-regime 
controls™ and analysis of trade-offs in fuel characteristics and fire types” 
confirm the primary role of climate, especially the gradient in summer 
monsoonal precipitation. Thus, fire frequencies tend to vary with latitude, 
decreasing towards the south and especially the arid interior. Most fire activ- 
ity on the Australian continent is in grass fuels and of relatively low intensity. 

Although palaeo-charcoal deposits document fire’s very long history 
in Australia®, fine-scale understanding of fire-regime variability through 
dendrochronology is generally lacking, hindering detailed perspectives 
on long-term variations in fire regimes. Comprehensive fire management 
initiatives focus on key environmental objectives, such as biodiversity 
conservation” and emissions reduction”, as a function of local context. 
Maintenance of contemporary fire regimes for biodiversity conservation 
is a priority in most regions, as opposed to the emphasis on restoration 
that dominates western US approaches. 

Australia’s productive eucalyptus forests, which can burn at very high 
intensities and low-moderate frequencies, are largely restricted to south- 
ernand eastern edges of the continent. Although these forests are char- 
acteristically Australian, their proximity to urbanized areas has probably 
fed the continent's reputation for high-intensity fire events (see “Where do 
people live?’). Debates over the degree to which fuel reduction, whether by 
mechanical or prescribed fire treatment, can alter the probabilities of high- 
intensity events” are similar to those that occur for western US forests. 

Prescribed burning in Australia is extensive, but controversial. Fuel 
reduction burning can partially reduce risk to human life and economic 
assets, although trade-offs with risks to environmental assets such as 
biodiversity and ecosystem services are not well understood’. How- 
ever, functional responses of species to fire frequencies, sizes, timings 
and intensities provide a measurable basis for predicting how ecological 


diversity will respond to management and climate change”. 


Resilience and climate change 

Ecosystem managers in the three regions covered here (Fig. 2) may have 
limited ability to alter the numbers, sizes and characteristics of fires 
occurring in different ecosystems”**”””. As already discussed, this is 
because coarse-scale climatic influences tend to control fire regimes 
in many ecosystems, especially those that are naturally prone to large 
and high-severity fires. Except under the most extreme conditions, 
fire regimes typically constrained by more local-scale controls, such 
as ignition frequencies and biomass accumulation rates, may respond 
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more strongly to prescribed fire and mechanical fuel reductions. This 
characterization of two opposing types of fire regimes is, however, a vast 
over-simplification — idealized end points along a spectrum of varia- 
tion within and between fire-prone ecosystems” — and management 
prescriptions need to somehow accommodate such complexity. Fur- 
thermore, fire-related sensitivities and responses vary among plant and 
animal species, so fire management for the persistence of one important 
group of organisms may not favour that of the others. 

The potential for climate change to cause ‘novel’ or ‘no analogue’ 
environmental conditions in some ecosystems presents new chal- 
lenges for management, policy and planning. An obvious goal is to 
have ongoing fire regimes that minimize the risk of biodiversity loss”. 
Yet, what adaptation responses are appropriate (Fig. 1) if we do not 
know how future climates and related biophysical processes will differ 
from the recent past? These uncertainties have resulted in somewhat 
similar recommendations about fire and ecosystem resilience*. 
Heterogeneity in vegetation types, stand structures and successional 
age classes at all spatial scales and environmental settings is emerging 
as a strategy for enhancing ecosystem resilience to climate change. 
This essentially facilitates diverse initial conditions for multiple future 
ecological trajectories, the most likely and successful of which will 
not be known for decades. The role of diverse topography in creat- 
ing microclimate refugia, or ‘holdouts’®, as well as in influencing fire 
sizes and severity characteristics within large fires**’, comprises the 
physical template for resilience in more mountainous regions. In eco- 
systems with a recent paucity of burning, fire management that fosters 
burning under diverse conditions may be useful for achieving this 
desired heterogeneity and reducing fuel accumulations”. Not all fire- 
generated heterogeneity is ecologically significant, however, so under- 
standing the effects of specific types of ‘pyrodiversity’ is important™. 


Where do people live? 

The WUI is the most proximate spatial manifestation of the cou- 
pling in a wildfire SES (Fig. 1). Understanding and addressing vul- 
nerabilities related to the WU] in fire-prone areas is therefore crucial 
to long-term solutions. As distances between urbanized areas and 
those protected from development decrease globally”, a growing 
WUI will expand the scope of coupling in wildfire SESs worldwide. 
Negative fire effects that were once due to distant’ fires (for example, 
the impacts of smoke on human health) will be increasingly common, 
making coexistence with wildfire much more challenging. 


REVIEW 


The current WUI of the western United States is relatively well 
characterized, with over 60% expansion since 1970 (ref. 70) and about 
70% in private ownership”. The WUI in this region also predomi- 
nantly occurs where fire severities are high”. Only 14% of private 
land in the western US WUI is developed, so substantial increases 
in human exposure to fire may occur as the remaining portions 
become populated”. Although less well characterized, there is grow- 
ing awareness of expanding WUI in Mediterranean Europe*’*”* and 
Australia’””’. 

Global systematic analyses of human settlement in fire-prone envi- 
ronments is important, but lacking”*. Coarse-scale characterization of 
how population densities relate to various fire-prone environments 
(Fig. 3) provides some insight. Although often characterized as a 
‘forest fire’ problem, western US patterns indicate that highly fire- 
prone locations with large numbers of people tend to be associated 
with sparse or no tree cover (for example, the chaparral shrublands of 
southern California); locations with both high population densities 
and denser forests exhibit the least area burned (Fig. 3, left). Australia 
exhibits greater area burned over a broader range of environments, 
with intermediate population densities being more fire-prone regard- 
less of the amount of forest cover (Fig. 3, middle). The Mediterranean 
basin is unique because the greatest area burned coincides with the 
highest population densities (Fig. 3, right), although this too occurs 
in locations with relatively low forest cover (for example, abandoned 
agricultural lands”). 

Acknowledging the diversity of the fire-prone environments and 
vegetation types where people live is important, because it has impli- 
cations for the types of fuel treatments that may or may not work to 
mitigate fire hazards within or near the WUL, and it could help to 
guide future resource allocation decisions (for example, among vegeta- 
tion removal, evacuation planning and home vulnerability retrofits)”. 
Awareness of the institutional and social diversity of different human 
communities is also important, as we discuss in the next section, 
because it influences their capacity for preparation and mitigation of 
hazards such as wildfires'*. 


Fire and human communities 

This section reviews research on how fires affect human communi- 
ties and is organized by the scale of coupling in a wildfire SES (Fig. 1), 
ranging from individuals to landscapes. Social science research on 
wildfire, primarily undertaken in Australia and the United States, 


BOX 1 


There is intense pressure on land-management agencies to reduce 
fire hazards (for example, rates of spread or flame lengths if a fire 
occurs). Treatments should be prioritized, however, where they may 
help to protect communities or reduce fuel loads in the areas that 
are most likely to experience uncharacteristically severe burns**”?. 
Mechanical fuel-reduction treatments are most suited to certain dry 
and fire-prone mesic forests**°9“"””, where thinning the density of 
smaller understory trees and removing surface fuel residues (non- 
merchantable tree tops and limbs) created by these treatments 

can reduce fire intensities and rates of spread*°. Not treating the 
additional surface-fuel by-products can actually increase fire 
intensity and severity when a wildfire does occur*?. 

Some of the most basic trade-offs that limit the widespread use 
of mechanical fuel reductions involve their economic viability. Often, 
larger commercial trees will be harvested to help offset operational 
costs, but this typically generates more surface-fuel residues. 
Moreover, opening up the overstory canopy and increasing sunlight 
penetration can increase growth of highly flammable understory 


What can ‘thinning’ of fuels achieve? 


vegetation. Controlling this growth response is an ongoing endeavour, 
the economic feasibility of which is unknown. 

Uncertainty about when and where treatments might actually 
perform as desired must also be considered. Although there are 
many examples of fuel treatments reducing fire behaviour when 
conditions are not extreme, recently treated forests can experience a 
stand-replacing crown fire when wind speeds exceed 30 kmh? and 
when fuel moisture is low!°. When the probability of fire occurring 
in a particular area is relatively low, the odds of a fuel treatment 
influencing the behaviour of a wildfire there, within the time frame 
that treatments are effective, is also low!®°. The degree of protection 
provided by a particular mechanical treatment may thus depend on 
uncertain parameters (for example, ignition patterns and extreme 
wind frequencies). 

In many areas, ecological restoration and fuel-management goals 
may be best balanced and accomplished through fire*!, which 
creates natural heterogeneity and provides for fire-dependent 
species. 
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Figure 3 | Relationship between forest cover, population density and 
area burned in fire-prone regions. Locations with both higher human 
populations and greater amounts of burning tend not to be consistently 
characterized by high forest cover. Patterns vary greatly among regions, 
reflecting the different contexts in which each side of the wildfire 
socioecological system have intersected. (Data were aggregated from 


is relatively sparse and not easily generalized. Work in the United 
States emphasizes social acceptance of techniques to mitigate fire 
risk (for example, fuel reduction on public and private lands) and, 
more recently, public response during and after fires”. In Australia, 
where many people do not evacuate during fires, risk perception, 
homeowner preparedness and response during fires, and commu- 
nity safety” are key areas of research. We also include studies outside 
the social sciences that have examined the role of vegetation and fuel 
treatments linked with losses and the built environment itself. 


Risk perception and public response 
Public response to wildfire is shaped by numerous factors, such as local 
context and individual personality and experience, so simple explanations 
for action or inaction do not exist. For instance, many researchers and 
managers assume that individuals do not understand fire risk. But US 
studies show that most people living in high-fire-risk areas understand 
their exposure, but there is a tenuous link between understanding risk and 
taking action to mitigate it; whereas recognizing risk might be necessary 
to consider mitigation, perceived efficacy of mitigation and resource con- 
straints can be more influential”. Similarly, whereas around 80% of people 
in the fire risk areas of Victoria, Australia, know they are in a hazardous 
area®', this does not necessarily translate to safer actions. After the devas- 
tating 2009 Black Saturday fires in Victoria, most people in high-fire-risk 
areas were aware of what new fire warnings meant and how to ensure their 
safety, but few acted on the knowledge when the highest-level warning 
was issued*’. A deeper understanding of the influences on preparedness, 
evacuation decisions and support for hazard mitigation is needed. 
Specific cultural and institutional systems affect public response to 
wildfire, as do psychological and social dynamics. For example, insti- 
tutional structures in the United States and Australia are quite different, 
but key social dynamics have many similarities. In both countries, trust 
is a key factor shaping public support for agencies, whether they provide 
information or engage in fire-management activities”. US studies of pub- 
lic acceptance of prescribed fire reveal that trust in the personnel imple- 
menting the burn, along with familiarity with the practice, are associated 
with higher acceptance levels**. In terms of the US public response during 
fires, evacuating has long been the norm, often with mandatory evacua- 
tion orders; until Black Saturday, Australians were urged to either prepare 
to stay and protect their properties, or to leave early, on the basis that either 
option was safer than leaving late”. Despite this difference, the range of 
public behaviours in both countries is similar, with some residents leaving 
early, some staying to defend and a substantial number waiting to see how 
the situation develops. Furthermore, individual actions do not necessarily 
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original sources (see Supplementary Information) to 0.25° resolution cells 
and plotted as density surfaces.) Forest cover is the percentage area covered 
by trees (>5 m height) per cell in 2000; population is number of people per 
cell (log transformed) in 2000; and fire is total area burned in hectares per 
cell (log transformed) between 1996 and 2012. The colour scale for fire is to 
help differentiate higher peaks in area burned. 


reflect a consistent response, as some household members may leave and 
some stay, while others go back and forth to check on property, animals or 
those who stay™. Although historically ‘stay or go’ seems to have worked 
reasonably well in Australia”, the approach was questioned after the Black 
Saturday fires, as it was widely seen to have contributed to many of the 
173 deaths. However, roughly half the people (around 3,000 households) 
in the burnt areas seemed to have stayed and defended their properties 
successfully and about halfleft, almost as the fire front was approaching. 
Most were satisfied with their decision and said they would do the same 
thing again™. Most also stated that they would like to be better prepared. 
The post-fire effort naturally concentrated on fatalities, with official 
advice after Black Saturday inquiries shifting to leaving early. 

When the public response is to evacuate, key elements to success 
include environmental conditions (especially fire-weather severity), 
patterns of roads, neighbourhoods and topography. In Australia, pub- 
lic warnings have been based on a fire-weather danger scale, which was 
revised after Black Saturday to capture the most extreme conditions, along 
with altered warning messages and advice for these extremes. There is 
some public understanding of the reclassification, but little evidence of 
altered behaviour® or understanding that weather conditions well below 
the extreme level are still dangerous. Analogous fire-weather warnings are 
issued regularly in other parts of the world, but are not standardized and 
rarely trigger evacuation orders. Similar to many regions, fatalities during 
evacuations in the Mediterranean basin tend to occur during the most 
severe weather conditions, when fires have already begun and people 
choose to evacuate too late®’; in addition, such extreme events seem to be 
on the rise”. A growing public safety challenge associated with evacuating 
people from fire-prone communities in mountainous terrain is limited 
road access. For example, housing densities are increasing in many WUI 
regions of the western United States without commensurate increases 
in the road network to support their evacuation®*. Emergency planning, 
including preparation of structures and training for those who choose to 
stay or simply cannot evacuate safely”, is thus increasingly important to 
the resilience of many communities in the regions reviewed here. 


Structures and surrounding vegetation 

To mitigate the risk of structure losses during wildfires, there is 
increasing evidence from many regions that it is best to focus on the 
house first and move outward from there”. Most structure losses are 
due to ember attack***’, when flaming or smoldering plant material 
is lofted by winds and blown inside or against the building or adja- 
cent elements, often long before the flaming front arrives. Embers 
can cause structure ignition by entering through gaps as small as 
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2 mm” or accumulating outside against flammable building (or 
surrounding) features. Once ember ignition is addressed through 
structural design or retrofitting, less prevalent modes of structure 
loss are important, such as radiant heat and flame exposure. To 
address these, both building design and surrounding vegetation 
management are normally considered in unison”, with the balance 
of these treatments being site specific. Similar to evacuation success, 
an understanding of the local fire-weather conditions and expected 
types of fires is required’. Hence, the building design strategy is 
to either consider all possible extremes and the weakest link in the 
system® or to pick a threshold level beyond which the structure 
may not survive. By relating these to a corresponding fire-weather 
severity, the occupant has the information for deciding when it is 
necessary to leave early. As a contingency, egress paths from the 
building interior to another building or area of minimal fuel could 
improve safety, but preparation for such a fallback is needed long 
before a wildfire arrives. 

Vegetation reduction is most effective immediately adjacent to 
structures**”?""4, as it can eliminate the most immediate sources of 
combustible material. Vegetation overhanging the structure”! and 
ornamental plants” have been strongly associated with structure 
loss. Vegetation clearances more than about 30 m away, however, 
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seem to provide no significant additional benefit in shrubland envi- 
ronments of southern California, even on steep slopes”, reflecting 
an important trade-off between hazard reduction and habitat values 
(for organisms dependent on the vegetation removed). Although 
these findings may only apply to similar shrubland environments, a 
similar distance to heavily vegetated areas has also been identified 
for some forested environments, based on radiant heat exposure to 
structures”””’. In Australia, however, a distance from forest edges of 
more than 30 m was found to influence home losses”, indicating that 
this buffer distance may vary substantially (for example, with fuels, 
weather and construction types). Another key reason to reduce veg- 
etation near the home is to provide a relatively safe place to engage in 
structure protection, in case home owners or firefighters are present. 
It is notable, however, that some species of well-maintained trees 
(litter removed and high foliar moisture) near the home can actually 
provide protection, screening embers” and acting as a heat sink” for 
an approaching wildfire. 


Landscape-scale patterns 

Although fuel treatments seem to provide the greatest protection when 
located near human communities’”**?**””, landscape-scale charac- 
teristics of the WUL itself are important. For this reason, a long-term 


BOX 2 


Regardless of the surrounding ecosystem conditions, all communities 
can better coexist with fire by taking several steps: retrofitting 
homes against ember attack, effectively managing fuels around 
homes, developing household and community plans for evacuation 
compared with stay-and-defend decisions, and participating in risk 
awareness continuing education. For existing high-hazard wildland- 
urban interface (WUI) areas, landowners may need to take primary 
responsibility for pursuing the optimal combination of adaptation 
measures, based on their local vulnerabilities and wildfire exposure. 
For development of new communities in high-hazard WUI areas, 
governments need to take a leadership role in planning. Regardless 
of responsibility, however, all of these efforts will be guided by better 
mapping of the fire hazard itself. 

The fire hazard severity zone (FHSZ) maps (Box Fig.) of California 
are an official product of the state Department of Forestry and 
Fire Protection based on a consistent statewide methodology for 
estimating potential fire behaviour under a set of relatively dry and 
high wind conditions. Variables that affect modelled fire behaviour 
include local topography and potential fuel loads, although weather 
conditions in the current iteration of maps are not tailored to local 
extremes. Future updates to the FHSZ methodology will incorporate 
locally varying wind patterns, better reflecting conditions that cause 
the worst fire-related losses of lives and homes*°*8. 

Fire-resistant residential construction standards are determined 
by the FHSZ rating of the location in question. In addition, FHSZ 
classifications must be disclosed at the time of home sales; although 
this may not deter a sale, it can affect the cost of insuring the home 
against fire losses. FHSZ maps are thus an incremental but important 
step towards treating fire like other natural hazards (for example, 
land-use restrictions associated with flood-plain and earthquake 
fault maps). Similar mapping methods and codes are produced in 
Victoria, Australia. Such maps do not explicitly restrict development 
from occurring — a constraint that should be considered in extremely 
hazardous locations. Comprehensive approaches should, however, 
help to better design communities within a complex matrix of 
both risk and resilience that such maps could reflect spatially. (See 
Supplementary Information). 


Adaptation measures and fire-hazard mapping 
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approach involving land-use planning offers great potential for reduc- 
ing wildfire impacts in human communities. A greater understanding 
is needed concerning building configuration in the WUI and how it 
relates to risk of losses and fatalities in various environments”. In 
some shrubland-dominated landscapes, the arrangement and loca- 
tion of homes have been the most important factors for explaining 
structure loss: landscape factors such as low housing density, isolated 
clusters of residential development and long distances to major roads 
are better predictors of house loss than local factors such as defensible 
space, fuel or terrain”*”*. Whether these findings apply to fire-prone 
landscapes in general or whether there are variations between devel- 
opment patterns and fire regimes needs further research. Although 
isolated clusters of development and low housing density mean that 
homes are embedded within, and more exposed to, a matrix of wild- 
land vegetation”, ignition-prone homes that are closely spaced in 
neighbourhoods can also facilitate the spread of house-to-house fire, 
especially during extreme fire weather. 


Achieving a sustainable coexistence with wildfire 

A coupled SES view of wildfire highlights the variation in each half 
of the SES, as well as how they come together at the WUI, to create 
many permutations of hazards and vulnerabilities for both human 
and natural systems. As such, there will be different thresholds for 
how harmful effects trigger action before, during and after wildfires, 
and competing societal pressures will influence the degree to which 
scientific findings are able to guide adaptive responses (Fig. 1). Despite 
such complexity, some priorities for future work emerge from the 
extensive research reviewed here. 

Context-specific and place-based approaches will be needed to 
address many existing and future coupled wildfire SES problems. 
This is because certain fire regimes are inherently more amenable to 
management activities than others, and also due to the institutional 
and social diversity that influences human capacity for mitigating risks 
to individuals and their communities. It is possible, however, that the 
permutations mentioned above collapse into characteristic typologies 
that could inform more systematic analyses. If so, are there mutually 
resilient combinations that are well matched or somehow compat- 
ible? Some fire regimes might dictate the degree to which evacua- 
tions should be mandatory or how resources might be allocated (for 
example, training homeowners to protect homes compared with fuel 
reduction or structure retrofits). A deeper understanding of the vari- 
ation, links and scales of causes and effects in coupled wildfire SESs 
is therefore vital. 

Governments have a primary responsibility in the long-term evolu- 
tion of the WUI and the degree to which it limits or amplifies trans- 
boundary threats in coupled wildfire SESs, so much greater attention 
to land-use planning is warranted. Land-use regulations to guide fire- 
related building codes (Box 2) or restrict development in the most fire- 
prone locations””*””"™ are clearly important steps that government 
agencies could take to manage the coupling in a wildfire SES. Agencies 
have a deeper role, however, in the growth of these trans-boundary 
threats. For example, the ‘safe development paradox’ applied to flood 
and hurricane protection demonstrates that making hazardous areas 
safer for human habitation in the short term actually increases the 
potential for severe losses over longer time scales’”’. Given that gov- 
ernment agencies around the world have focused on reducing fire 
hazards (for example, through subsidized fire suppression and/or fuel 
reduction), much less attention has been paid to the ways in which vul- 
nerable WUI development might have been designed from the start. 
As further development occurs and the WUI expands, so does the 
need for increased hazard reduction. A perverse consequence of the 
typical human reaction to fire — to fight it instead of accommodate 
it — thus contributes to a deepening of coupled wildfire SES problems. 

Strategically addressing threats at the WUI maximizes the poten- 
tial for both effective risk mitigation within developments and 
management for sustainable fire regimes over the broader sweep of 
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landscapes. Ultimately, trade-offs and sacrifices must be made to 
balance these competing demands, but concentration of manage- 
ment effort for risk mitigation in the WUI minimizes the area where 
adverse effects on environmental assets are likely. Better maps of 
fire hazards, ecosystem services and climate change effects are thus 
important for assessing these and other related trade-offs. Addressing 
all social, economic and environmental assets at risk will necessarily 
focus on separating those that require exclusion of fire from those 
where fires of some sort are desirable or inevitable. However, it is 
unlikely that any planning or management regime will completely 
exclude fires from vulnerable developments on many landscapes 
(considerable residual risk to people and property will endure). The 
capacity for communities to cope with the inevitability of fire, as well 
as its effects at multiple scales, will therefore be essential. 

There is a great deal of research to support better policy, planning 
and management in all aspects of the coupled wildfire SES problem. 
Viewing fire as a natural and inevitable hazard should be central to 
most solutions, so we can anticipate its important positive and nega- 
tive effects on both human and natural systems. Given that combus- 
tion is one of the most basic and ongoing natural processes on Earth, 
we must continue to learn from our experiences to achieve a sustain- 
able coexistence with wildfire. m 
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The performance and 
potential of protected areas 


James E. M. Watson’*?, Nigel Dudley’, Daniel B. Segan?? & Marc Hockings’* 


Originally conceived to conserve iconic landscapes and wildlife, protected areas are now expected to achieve an increasingly 
diverse set of conservation, social and economic objectives. The amount of land and sea designated as formally protected has 
markedly increased over the past century, but there is still a major shortfall in political commitments to enhance the cover- 
age and effectiveness of protected areas. Financial support for protected areas is dwarfed by the benefits that they provide, 
but these returns depend on effective management. A step change involving increased recognition, funding, planning and 
enforcement is urgently needed if protected areas are going to fulfil their potential. 


he past few decades have seen protected areas undergo a pro- 

nounced expansion, geographically and conceptually. The 

collective decisions of governments, publicly funded bod- 
ies and local communities have created the rapid growth of pro- 
tected areas throughout the world’; land and sea management has 
seldom changed so quickly over such a large area (Fig. 1). At the 
same time, as many natural ecosystems fragment’, the expectations 
placed on protected areas by a growing diversity of stakeholders 
have dramatically increased. Protected areas are now created not 
only to conserve iconic landscapes and seascapes and to provide 
habitat for endangered wildlife, but also to contribute to the liveli- 
hood of local communities, to bolster national economies through 
tourism revenues, to replenish fisheries and to play a key part in 
the mitigation of, and adaptation to, climate change, among many 
other functions’. Importantly, these new demands are in addition to, 
rather than as a replacement for, earlier motivations, necessitating 
trade-offs between competing objectives*”’. Although the expanded 
role of protected areas may have fuelled their establishment, their 
constantly changing focus makes them vulnerable to accusations of 
failure to achieve one or more of these objectives. 

In this Review we explore the extent to which the potential of the 
global protected area estate is being met and outline the ways in which 
performance can be improved. We first discuss how broad objectives 
for protected areas have changed over the past century and how this has 
corresponded with substantial growth in the global protected area estate. 
We provide evidence that protected areas — when they are well managed 
and targeted at threats that they can abate — can deliver a wide range 
of benefits: protecting magnificent landscapes and seascapes, achieving 
biodiversity conservation and delivering essential ecosystem services. 
But we show that the current coverage of the global protected area estate 
is still well short of meeting core 2020 targets outlined in the Convention 
on Biological Diversity (CBD)°. We explore key issues that are affecting 
protected area effectiveness and provide recent examples of what seems 
to be a backslide in commitment by some nations through the defund- 
ing, downscaling and delisting of protected areas. Given the key role of 
protected areas in many social and environmental agendas, the current 
targets set for conservation and development, and the reduced political 
commitment in some countries, we conclude this Review with a call for 
a step change in support for the global protected area estate. 


The protected area movement 

Protected areas are not a modern concept. In different forms, they have 
been around for millennia, whether they are sacred sites guarded by 
indigenous communities, ‘tapu’ areas for communal resource use in 
the Pacific island region or hunting areas set aside to benefit the ruling 
classes’. Over the past century the modern concept of protected areas 
has been developed and refined, and the global protected area estate has 
grown extremely rapidly, from a handful of sites at the turn of the twenti- 
eth century to more than 162,000 legally designated (statutory) national 
protected areas, covering more than 28.4 million square kilometres (or 
5.6% of Earth’s surface)’ (Fig. 1; see Supplementary Methods for details 
of protected area calculations). The amount of land now found in statu- 
tory protected areas comprises a total area greater than that of South and 
Central America, and the amount of sea comprises a total area greater 
than that of the Caribbean Sea, South China Sea, Mediterranean Sea and 
Bering Sea. These numbers are further augmented by many additional 
protected areas that are not included in the official tally — those estab- 
lished by local communities, indigenous peoples, private individuals, 
non-profit trusts, religious groups and even corporations® — some of 
which, such as indigenous territories in the Amazon basin, can them- 
selves be extremely large’. 

The modern protected-area movement had its nineteenth-century 
origins in North America, Australia, Europe and South Africa, where 
protected areas were mainly set up to protect spectacular natural fea- 
tures and wildlife’’, principally in areas with little potential for eco- 
nomic use". Although many were set up with the dual mandates of 
landscape and species protection and public use, it was not until around 
the middle of the twentieth century that tourism inside protected areas 
accelerated’’. In some developing countries, the income associated 
with protected areas is of national significance”; in Rwanda, tourism 
revenue from visits to see mountain gorillas inside Volcanoes National 
Park is now the country’s largest source of foreign exchange, raising 
US$200 million annually”. 

Emerging concern over environmental degradation in the last quarter 
of the twentieth century influenced motives for establishing protected 
areas’. Recognition of the importance of in situ conservation led to a 
marked expansion of the global terrestrial protected area estate in the 
1970s as countries moved to establish protected area networks where 
species and ecosystems could be conserved from the rapid changes 
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taking place elsewhere (Fig. 1). The decision to use protected areas as 
a core conservation strategy seems to be well justified: recent reviews 
have concluded that, most of the time, well-managed protected areas 
reduce rates of habitat loss (the chief threat to biodiversity’®) in both 
terrestrial’”'* and marine’””’ environments. There is also strong evi- 
dence that protected areas maintain species population levels (including 
threatened species) better than other management approaches’. 
For example, well-managed marine protected areas (MPAs) have been 
found to contain more than 5 times the total large fish biomass and 
14 times the shark biomass compared with fished areas’’, anda study of 
60 terrestrial protected areas in the tropics found that when they are well 
managed, there is a positive outcome for biodiversity”*. This protection 
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Figure 1 | Growth of the modern terrestrial and marine protected area 
estate. Growth in protected areas is aligned with a series of key events that 
have signalled an expansion of objectives over the past 150 years, starting with 
the establishment of the first formal protected area in 1864 (see Supplementary 
Table 1). New and increasingly diverse focal objectives have added to, rather 
than replaced, pre-existing objectives so that the requirements for management 
of protected areas have expanded over time. The growth was calculated using 
data obtained from the World Database on Protected Areas'. 
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also extends to species that have high financial value and are under 
intense pressure from well-organized criminal groups”. 

However, because of the rapid growth of protected areas in the latter 
half of the twentieth century, the contact that local communities had 
with them increased. At times this growth was in conflict with the needs 
of local communities and with efforts to address poverty and increase 
economic development”””*. This resulted in widespread criticism of 
management practices in some protected areas by those concerned 
with human rights. During this period, management priorities began 
to shift towards greater recognition of the rights of local communities 
regarding the governance of the areas in which they live; many protected 
areas now have management regimes that engage local communities 
and consciously seek to balance conservation with local livelihoods”. 
In addition, the part that protected areas play in poverty alleviation 
and fostering economic development in the surrounding communi- 
ties gained increased attention, and there is now evidence of positive 
contributions in many regions**””. 

During the past two decades we have witnessed a greater emphasis 
on the role that functioning ecosystems have in maintaining human 
societies”. Well-managed protected areas can provide crucial ecosystem 
services, including water, food security, protection of wild relatives of 
crops, maintenance of wild fish stocks and carbon storage****. As popu- 
lations urbanize, the role that protected areas have in providing clean 
water for cities is increasing: a third of the world’s 100 largest cities rely 
on protected areas as a significant source of drinking water”. Protected 
areas are also now seen as a crucial component of global climate change 
mitigation efforts”, and are likely to play an increasingly important part 
in REDD+ (reducing emissions from deforestation and forest degrada- 
tion, plus the conservation, sustainable management and enhancement 
of forest carbon stocks) schemes. 

Although there is a strong global consensus within the conservation 
community that the principal role of protected areas is nature conser- 
vation”, in practice they are expected to make much wider ecological, 
social and economic contributions to human society. It is not yet clear 
what the overall impact of these increasing demands on protected areas 
will be or whether additional demands will arise in the future’. 


Coverage and targets for terrestrial protected areas 
As of April 2014 (ref. 1), the official global portfolio of nationally des- 
ignated terrestrial protected areas numbered 155,584 and covered 
18.4 million km’, or 12.5% of the terrestrial realm (Supplementary 
Methods). This is still well short of the current CBD target of 17%°,a 
figure that has grown from 10% since the 2000-2010 strategic plan, but 
is still a political compromise that many conservation scientists believe 
is too low”'. The shortfall is larger still when you consider CBD guid- 
ance that states protected areas should target places of “importance for 
biodiversity” that are “ecologically representative”®. At broad ecologi- 
cal scales, coverage markedly varies between major terrestrial biomes 
and ecoregions”. Using the latest available data’, we found that only 
300 terrestrial ecoregions (36%) have more than 17% coverage, with 
237 regions (29%) having less than 5% coverage and 68 (8%) having less 
than 1% coverage (Fig. 2a). When finer-scale analyses are conducted to 
assess whether protected areas are being placed in areas important for 
conserving species, the same patterns of variability occur. Among key 
biodiversity areas (KBAs)*’, only 28% of Important Bird Areas (sites 
identified as crucial for bird biodiversity) and 22% of Alliance for Zero 
Extinction sites (sites that hold more than 95% of the global population 
of an endangered species“) are adequately covered by existing protected 
areas’. A recent global analysis of all threatened birds, amphibians and 
mammals (n = 4,118) found that 17% are not found ina single protected 
area and 85% do not have sufficiently large populations in protected 
areas to give thema reasonable chance of long-term survival®. In com- 
parison, a decade ago 20% of globally threatened terrestrial birds, mam- 
mals and amphibians were not found in a single protected area and 89% 
were inadequately represented”. 

In the past, the patchy representation of species and ecosystems in 
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protected areas has often been attributed to weaknesses in planning 
methods”. But these new analyses show that significant additions to 
the terrestrial protected area estate over the past two decades (Fig. 1) 
have not significantly lessened biases towards higher elevations, steeper 
slopes, and lands of lower productivity, lower economic worth and low 
human density. Progress in achieving ecological representation has 
almost come to a stop, and this is likely to have serious ramifications 
when it comes to threats such as climate change”. 


Coverage and targets for marine protected areas 

Progress towards the CBD’s target of 10% of coastal and marine areas 
being protected® has been much slower than progress towards their 
terrestrial equivalent (Fig. 1). The goal is also much less ambitious 
than the 15-20% target proposed by non-governmental organizations 
(NGOs) and other groups attending the 2010 CBD Conference of the 
Parties. The tradition of open access to marine systems is a possible 
reason why it has taken longer for MPAs to be embraced than terrestrial 
protected areas. However, growth rates have been increasing since the 
World Summit on Sustainable Development in 2002 (ref. 51), and it is 
estimated that the 7,318 MPAs cover about 10.1 million km’ (or 3%) of 
the world’s marine environments’. Most designated MPAs have been 
restricted to national waters, with MPAs now covering about 6.6% of 
delimited exclusive economic zones globally~. The recent trend towards 
establishing very large (greater than 100,000 km’) MPAs is accelerating 
the expansion of global coverage: a recent analysis found that ten exist- 
ing or under-creation MPAs account for more than 53% of the world’s 
total MPA coverage”*. Although large and remote MPAs may be impor- 
tant for maintaining functioning marine ecosystems, they will not avert 
imminent and direct anthropogenic threats in populated, coastal waters 
in which pressures on biodiversity often remain intense”’. With these 
biases in mind, it is unsurprising that only 46 (20%) of the 232 marine 
ecoregions have more than 10% coverage and 107 ecoregions (46%) 
have less than 1% coverage (Fig. 2b). To our knowledge, there has been 
no fine-scale global analysis of MPA coverage of species or important 
marine biodiversity areas; this means there is a significant shortfall in 
our understanding of protected area gaps in the marine realm. 


Protected area effectiveness 

Beyond achieving a percentage protected area coverage in the marine 
and terrestrial realm, the latest CBD targets® also call for protected areas 
to be effectively managed. Although systematic research into protected 
area effectiveness is still in its infancy, there are global studies that point 
to a significant shortfall in effectiveness — only 20-50% of protected 
areas assessed were found to be effectively managed’. These global 
averages mask even more critical situations in some ecosystems, and 
there are now many examples of protected areas not achieving basic 
objectives. For example, a recent assessment of coral reefs within MPAs 
in the western Pacific Ocean's Coral Triangle found that only 1% were 
effectively managed”, and research examining vegetation loss in pro- 
tected areas in South Asia has shown that the trajectories of habitat 
conversion rates inside protected areas are sometimes indistinguishable 
from those of unprotected lands”. There is also evidence that threat- 
ened species populations inside some protected areas are declining”, 
including charismatic fauna such as lions (Panthera leo)”, Sumatran rhi- 
noceros (Dicerorhinus sumatrensis)® and African elephants (Loxodonta 
africana)”. 

Even some globally renowned protected areas, formally designated as 
UNESCO (United Nations Educational, Scientific and Cultural Organi- 
zation) World Heritage Sites, have been shown to have experienced 
serious ecological degradation, partly due to poor management effec- 
tiveness. Australia’s Great Barrier Reef Marine Park, for example, has 
experienced significant degradation, with large declines in coral cover 
in less than 30 years, as well as substantial declines in species popula- 
tions, and habitat condition and extent across large areas of the par S69: 
Similarly, Ecuador’s Galapagos National Park and Marine Reserve — 
one of the best-known protected areas in the world — has widespread 
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Figure 2 | Percentage of each terrestrial and marine ecoregion represented 
in the 2014 protected area estate. a, The shortfall in protected area coverage 
of terrestrial ecoregions (n = 827) relative to the Convention on Biological 
Diversity (CBD) target of 17%. As of 2014, only 300 ecoregions (36%) have 
more than 17% coverage, with 68 (8%) having less than 1% coverage and 

237 (29%) of all ecoregions having less than 5% coverage. b, The shortfall in 
protected area coverage of marine ecoregions (n = 232) relative to the CBD 
target of 10%. As of 2014, only 46 (20%) of the marine ecoregions have more 
than 10% coverage with 107 (46%) having less than 1%. (See Supplementary 
Methods for details of protected area calculations.) 


problems including alien species invasion, population collapse of 
exploited marine species™ and declines in ecosystem condition”. 
Under-resourcing of protected area management is the primary rea- 
son for poor performance in protected area effectiveness, especially in 
the developing world®. Fewer than 6% of the countries reporting to 
the CBD in 2003 indicated that resources for management of protected 
areas were adequate, and it is unlikely this number has improved sub- 
stantially”. A lack of resources affects boundary demarcation, effective 
law enforcement, natural and cultural resource management, and the 
provision of adequate park infrastructure, all of which affect protected 
area performance. Effectiveness is further undermined by poor 
governance quality and bureaucratic inefficiency in many protected 
areas, alienating stakeholders and eroding support for management 
decisions™. Political corruption and armed conflict also undermines 
protected areas in many parts of the world, rendering protection efforts 
ineffective. Addressing these issues is likely to be a pre-condition for 
successfully instituting other reforms to improve management”. 
Resources currently available for management pale in comparison 
with the challenges a protected area faces. Underlying pressures such 
as demographic growth, climate change and human consumption of 
natural resources, are all increasing, contributing to direct drivers of 
damage from encroachment, agriculture, infrastructure projects and 
timber demand”. These pressures are on the increase around many 
protected areas, with important implications for the overall effectiveness 
of the protected area — there are generally strong, positive correlations 
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between threats outside and inside protected areas’'. Mining is now 
a serious pressure, with a recent global analysis showing that mining 
activity inside protected areas collectively affects 6% of the terrestrial 
protected area estate, and mines within 10 km of parks influence 14% of 
the global estate”. In the face of mounting threats, protected-area man- 
agers are hampered by the linked problems of insufficient resources, 
lack of capacity and, in many cases, a poor understanding about how to 
address the suite of pressures. 


Declining support for protected areas 

Despite making global commitments towards increasing the size and 
effectiveness of the protected area estate within the CBD framework‘, 
there is now significant evidence that some governments are sliding 
back on their commitment to support protected areas through dispro- 
portionate funding cuts, reductions in professional staff and by ignoring 
their own policies (Fig. 3, Supplementary Table 2). If this is representa- 
tive of a global trend, many protected areas will be left seriously exposed, 
especially in the context of pre-existing levels of underfunding and ris- 
ing threats. 

Although more common in developing countries”’, inadequate 
financing of protected areas is also becoming increasingly prevalent in 
some of the richest countries, such as Australia, the United States and 
Canada, where major reductions in staffing levels and funding have 
been recently observed (Supplementary Table 2). For example, between 
2009 and 2013, the US National Parks Service's main operational budget 
for national parks fell by nearly 13% (excluding the supplemental fund- 
ing provided in response to Hurricane Sandy)”. Chronic underfund- 
ing has led to a deferred maintenance backlog estimated at between 
$9.03 billion and $13.28 billion, which has continued to build over the 
past decade”. In Canada, budget cuts have severely affected the science 
capacity of Parks Canada; ecosystem-science positions were cut by up 
to 30%, even though an expert panel, which was set up to review the 
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integrity of Canada’s national parks, had identified the need for greater 
investment in this area to redress what they identified as significant 
impairments to the ecological condition of protected areas”. 

Arguably, a more intransigent form of protected-area failure is 
government changes, through policy, that open up sites to resource 
extraction, or partial or full ‘degazettement. This practice has been 
labelled protected area downgrading, downsizing and degazette- 
ment (PADDD)”, where downgrading is the legal authorization of 
an increase in the number, magnitude or extent of human activities 
within a protected area; downsizing is the decrease in size of a protected 
area through a legal boundary change; and degazettement is the loss of 
legal protection for an entire protected area. A recent global analysis of 
543 instances of PADDD indicated that all three forms are increasing”. 

Most instances of PADDD have occurred in developing countries, 
where demographic changes and demand for land have put pressure 
on ecosystems, and where governments seek revenue from natural 
resources to meet development needs. It is not uncommon for the 
ministries responsible for mining or logging to issue leases on land or 
water that are already designated as protected. For example, in Uganda, 
active oil exploration and development is occurring inside protected 
areas, including the western portion of Murchison Falls National Park 
and inside three wildlife reserves that contain threatened species such 
as lions and Rothschild’s giraffe (Giraffa camelopardalis rothschildi)”*. 
Examples of downsizing include the World Heritage Sites the Selous 
Game Reserve in Tanzania, which was reduced to allow for uranium 
mining, and the Virgin Komi Forests in Russia where significant bound- 
ary changes to reserves such as the Yugyd Va National Park were made 
to allow mining (Supplementary Table 2). These are not isolated cases; 
a review of oil and gas concessions and protected areas in sub-Saharan 
Africa found that concessions overlapped park boundaries in 17.3% of 
International Union for Conservation of Nature (IUCN) category I-I] 
sites and in more than a quarter of world heritage sites”. 
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Figure 3 | A global portrait of different forms of decline in government 
support in terms of protected areas. Here we provide examples of recent 
(defined as the past 10 years) decisions made by governments that are leading 
to a decline in the effectiveness of protected area estates within countries. 
The blue and green areas show the spatial extent of the global protected area 
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estate as of April 2014 (ref. 1). Examples are well-documented instances 

of this retreat; it should be noted that many more examples remain poorly 
documented. More information and additional examples are provided in 
Supplementary Table 2. PADDD, protected area downgrading, downsizing 
and degazettement. 
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Full degazettement is a more unusual form of PADDD, but is also on 
the increase (Fig. 3; Supplementary Table 2): the Arabian Oryx Sanctu- 
ary in Oman was removed from the World Heritage List after the gov- 
ernment reduced the size of the reserve by 90% to allow for oil and gas 
extraction™. Developed countries are not immune to PADDD, as wit- 
nessed in Australia in recent years with the opening up of parks to allow 
industrial logging, livestock grazing, mining, recreational hunting and 
fishing*'. In the United Kingdom, a proposed high-speed rail link will 
damage or destroy ten Sites of Special Scientific Interest, government- 
recognized protected areas and nine NGO-run Wildlife Trust reserves”. 


The case for a step change 

The current biodiversity crisis would be much more severe had the 
establishment of protected areas in the last century not occurred. But 
a fundamental increase in support of the global protected area estate 
is now urgently needed if it is to fully deliver its potential. Countries 
are still well short of what they formally agreed to do in the 2020 CBD 
strategic plan®, and the resources currently provided for protected-area 
management are, for the most part, insufficient. Many countries are 
even failing to take adequate steps to maintain what they have already 
designated for protection. The increasing number of governments 
overtly or covertly decreasing resources and allowing incompatible 
use of protected areas sounds a clear alarm bell for current and future 
performance. 

This erosion of support for maintaining and growing the global pro- 
tected area estate is occurring at a time when anthropogenic climate 
change, the sudden upsurge in poaching, ‘land-grabbing’ by powerful 
businesses and increased mining activity are all making conservation 
challenges more complex. As human populations grow and pressures on 
natural ecosystems increase, more species and ecosystems are becoming 
predominantly (and in some cases, completely) confined to protected 
areas. These include large, threatened mammals such as the Asian ele- 
phant (Elephas maximus)”, the tiger (Panthera tigris)” and all rhinoc- 
eros species™, but also numerous plants, reptiles and amphibians’. At 
the same time, large-scale anthropogenic modification of natural eco- 
systems means that protected areas are now crucial for sustaining a large 
proportion of the world’s poorest people by providing them with life's 
most basic subsistence necessities — food, water, shelter and medicine’. 

A business-as-usual approach, by which most countries do not pro- 
vide adequate resources to ensure effective management of protected 
areas and undervalue the need for continued expansion of the protected 
area estate, means that the broad goals of the estate will fail. A funda- 
mental step change is needed to ensure both the current and future 
potential of the estate is met. 

First, countries need to create management regimes for existing pro- 
tected areas that ensure they are effective together with policies that 
support protected-area systems. Management outside protected areas 
can, and must, make crucial contributions to securing the future of 
biodiversity, but this is in no way a replacement for the clear benefits that 
protected areas provide, especially with respect to conserving KBAs™. 
Crucially, policies should be aligned within government so that the 
actions of ministries dealing with development, resource extraction 
and agriculture do not undermine those of ministries concerned with 
the environment and conservation. Much progress could be achieved 
simply by implementing approved policies on protected areas that have 
already been established through multilateral environmental agree- 
ments such as the CBD, the Ramsar Convention on Wetlands and the 
World Heritage Convention, as well as implementing the IUCN’s guid- 
ance on the definition, management objectives and governance types 
within protected areas”’. Adopting the legal principle of non-regression 
when it comes to environmental laws is also an important action that 
countries can take**. Another part of the step change is for countries to 
acknowledge that the widening expectations of protected area systems, 
along with growing pressures on protected areas, increase the burden 
for all those involved in management and requires improvements rather 
than declines in policy and resource support. 
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Second, countries need to invest adequately in protected areas to 
ensure that their objectives are achieved. Part of this is recognizing the 
return on investment that well-managed protected areas provide by 
conserving natural heritage and increasing the social and economic 
well-being of their citizens**’. Countries need to start quantifying 
the services provided by protected areas and recognizing the costs 
of protected-area degradation — it should be noted that many of the 
benefits will not easily be measurable in monetary terms. In Australia, 
the 2012-13 budget for the Great Barrier Reef Marine Park Authority 
was approximately Aus$50 million, but tourism to the reef was worth 
more than Aus$5.2 billion annually to the Australian economy”; this 
income is seriously threatened by the current degradation of the reef. 
In 2009, the Canadian government spent Can$800 million, but the 
contribution to the economy was Can$4.6 billion and supported the 
employment of 64,000 people”. A better understanding of the returns 
on investment would help to persuade countries of the need to provide 
resources to protected areas that better match the benefits received. In 
the Eastern Arc Mountains in East Africa, one-third of protected areas 
do not receive the minimum funds necessary to be effective reserves, 
even though it would only require reinvestment of 13% of the revenue 
raised through protected area tourism in Tanzania to fully fund the 
protected area estate™. 

There is ample evidence to justify more state support of protected 
areas. But good arguments do not always translate into large amounts 
of financial resources, and conservation is often an early casualty of 
any government funding squeeze. As a consequence, the third compo- 
nent of the step change is to accept the fact that governments will often 
not supply sufficient financial resources for protected areas and that 
there is a need to identify innovative models for ensuring protected 
area success; in other words, to encourage the wider community to take 
collective responsibility for protected areas. Non-conventional fund- 
ing sources (philanthropic contributions, and payments for ecosystem 
service mechanisms such as REDD+) have the potential to be crucially 
important future alternative funding sources, as do mechanisms such as 
offsets and ‘debt-for-nature swaps’ from the corporate sector™. All these 
options need careful appraisal and, where appropriate, more strategic 
application; we know that some of them can work in some places, but 
they are currently applied very sporadically”. In addition to broad- 
ening the funding base of protected areas, the next required change 
of approach is for a similar explosion in management collaborations. 
Building resilient social constituencies that advocate on behalf of pro- 
tected areas and biodiversity conservation requires the formation of 
coalitions across local, national and international actors (government, 
NGO, business and community groups) across the political spectrum; 
yet too little attention has been paid to the requirements of building a 
political constituency that will underpin long-lasting commitments to 
conservation”. At the site scale, volunteers can provide a substantial 
resource for managers of protected areas, filling roles ranging from 
those involving site maintenance to anti-poacher patrols, and there is 
evidence to show that properly planned voluntary efforts can be harmo- 
nized with professional activities to produce the desired management 
outcomes”. For example, partnerships between protected area agencies 
and scientists can bridge research and monitoring gaps in a mutually 
beneficial way, and citizen science can not only provide information 
for managers but also build a supportive and hopeful constituency for 
conservation” ”*, 

Finally, most countries still need to expand their protected area net- 
works to meet CBD obligations and the pressing needs outlined in this 
Review. The challenges of achieving this on an increasingly crowded 
planet should not be underestimated. It will require countries to 
embrace transparent planning frameworks to identify the new areas 
needed to achieve the objectives outlined clearly by the CBD, rigorous 
stakeholder consultation, imaginative application of a range of manage- 
ment approaches and then acting on plans and monitoring results trans- 
parently. A key element is to expand reporting beyond simply the area of 
land and seas gazetted to include ecological connectivity, management 
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effectiveness, equity, social and economic benefits, and the contribution 
of the system to conserving areas that are important for biodiversity. 

The package of responses needed for the step change is neither impos- 
sible nor unreasonable, although individual countries may struggle with 
some of the components. Fundamentally, it requires the recognition that 
protected areas are core to the future of life on our planet. Estimations of 
the annual cost of adequately managing an expanded network of marine 
and terrestrial protected areas range from $45 billion to $76 billion’, 
the lower of which is just 2.5% of the global military expenditure”. But 
adequate protection of marine and terrestrial environments is also cru- 
cial to global security. It seems sensible to invest an amount equivalent 
to a tiny percentage of global military spending to help provide security 
for humans and all other living organisms on Earth through a system of 
marine and terrestrial protected areas that is operating at its full capac- 
ity. Although we need to understand and report on the performance of 
protected areas, we also need to focus on the promise and potential they 
provide for the well-being of the planet and its inhabitants. m 
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Life cycles, fitness decoupling and the 
evolution of multicellularity 


Katrin Hammerschmidt'*, Caroline J. Rose'*, Benjamin Kerr” & Paul B. Rainey'* 


Cooperation is central to the emergence of multicellular life; however, the means by which the earliest collectives (groups 
of cells) maintained integrity in the face of destructive cheating types is unclear. One idea posits cheats as a primitive germ 
line in a life cycle that facilitates collective reproduction. Here we describe an experiment in which simple cooperating 
lineages of bacteria were propagated under a selective regime that rewarded collective-level persistence. Collectives 
reproduced via life cycles that either embraced, or purged, cheating types. When embraced, the life cycle alternated 
between phenotypic states. Selection fostered inception of a developmental switch that underpinned the emergence of 
collectives whose fitness, during the course of evolution, became decoupled from the fitness of constituent cells. Such 
development and decoupling did not occur when groups reproduced via a cheat-purging regime. Our findings capture 
key events in the evolution of Darwinian individuality during the transition from single cells to multicellularity. 


Cooperation has a central role in the evolution of multicellularity'®. 
Even under laboratory conditions, simple undifferentiated groups of 
cooperating cells readily evolve; however, such groups are often short 
lived: selection typically favours the evolution of cheats” '*. Cheats are 
cells that do not contribute towards group integrity, but nonetheless take 
advantage of the benefit that accrues from being part of a collective’. 
In the absence of cheater-suppression mechanisms, cheats may pros- 
per to the point where the integrity of any newly emerged group is 
compromised'*"'*. The problem of cheating has led to the suggestion 
that the evolution of mechanisms for cheater suppression is a critical 
step in the transition to multicellularity'’*~’. In this Article we explore 
an alternative possibility; namely, that cheats may play a critical role in 
a simple multicellular life cycle where the central problem is not cheater 
suppression, but rather controlled generation of this phenotype”. 

The evolution of simple groups of bacteria occurs repeatedly when 
populations of the bacterium Pseudomonas fluorescens are propagated 
in spatially structured microcosms’****. Such collectives—‘wrinkly 
spreader’ (WS) mats—arise by spontaneous mutations from the ances- 
tral ‘smooth’ (SM) genotype”*”*. The mutations cause WS cells to over- 
produce a cell-cell glue**~”’ that holds daughter cells together following 
cell division’*. The net effect is a cellular mat that colonizes the air-broth 
interface. Although glue production is costly to individual cells, the trait 
spreads” because the group of mat-forming cells reaps an advantage 
(access to oxygen) that is denied to individual cells’. 

The life span of WS mats is brief: selection acting on individual cells 
favours mutant types that cheat. Cheating cells are phenotypically SM 
and no longer produce adhesive glues’; nonetheless they take advantage 
of the benefit that accrues from being part of the mat. In the absence of 
any mechanism of cheater repression, cheats prosper—ultimately weak- 
ening the fabric of the mat to the point where it collapses’. 

While cheats pose a significant problem, the tension between coop- 
erating and cheating cells could fuel evolution”**”*'. Consider a newly 
emergent WS mat. In the absence of a means of collective-level repro- 
duction, the mat, like soma, is an evolutionary dead end. The emergence 
of cheats however is guaranteed. While cheats may destroy the mat, they 
also stand as a means of mat reproduction, provided they can regenerate 


the mat. The cycling between WS groups and SM cells—driven by the 
niche-constructing activities of each type’’—has the potential to gener- 
ate a primitive life cycle” (Fig. 1a, left panel) and with this the possi- 
bility that WS groups might participate, even just marginally, as units 
of selection in the process of Darwinian evolution”’. 


Embracing cheats 


To test the plausibility of the idea that SM cells might function as the 
seeds for a new generation of WS mats, we propagated mats under a 
regime in which SM cells were integral to mat reproduction (Fig. la, 
left panel) (this treatment stands in stark contrast to a regime discussed 
below in which cheats are purged (Fig. la, right panel)). Each genera- 
tion was founded by a single WS genotype (Fig. 1b). For lines to persist 
WS mats had to remain intact (viable) until the end of phase I, and be 
fecund (produce SM types). In addition, during the three-day phase II 
period, SM cells were required to transition back to WS. On completion 
of the cycle a single colony of the most dominant WS type was trans- 
ferred to a fresh microcosm ensuring new WS mats arise through a 
bottleneck and are thus re-established free of within-mat conflicts***. 

Transitioning between group and single-cell phases relies initially 
upon mutation and poses significant challenges. In an initial experiment 
in which 120 lines were required to repeatedly transition between WS 
and SM states, all lines went extinct by the sixth life-cycle generation. 
Extinction was primarily due to insufficient SM-cell production before 
day six of phase I (Fig. 1c). 

Given the prominent role of cheats and the requirement for mutation 
to transition each phase of the life cycle, the persistence of lines through 
six generations is surprising: it indicates a capacity for innovation. Such 
a capacity might, under different circumstances, provide opportunity 
for evolutionary refinement to the point where cycling through phases 
could come under developmental control. 

Most non-neutral mutations are deleterious and thus prone to even- 
tually disrupt the life cycle. Persistence is therefore likely to depend on 
viable lines having an opportunity to export their success to a new micro- 
cosm by division of the lineage. Were the splitting of viable lines to oper- 
ate concomitantly with the elimination of unsuccessful lines then selection 
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Figure 1 | Experimental regimes and extinction dynamics. a, Cheat- 
embracing (CE) and cheat-purging (CP) regimes. b, Each line is founded by a 
single WS genotype (yellow). After 6 days in phase I cells are harvested, plated 
and screened for SM colonies (red). Pooled SM colonies found phase II. 
After 3 days cells are harvested, plated and screened for WS colonies. A single 
WS colony of the dominant type founds the next generation. To avoid 
extinction, the WS mat of each line must be intact at the end of phase I and WS 
types must have produced SM cells; by the end of phase II, SM types must have 
produced WS cells. c, Persistence of lines (n = 120) and causes of extinction. 


among lineages might allow the possibility for life-cycle-enhancing muta- 
tions, which are beneficial over the longer time scale of the life cycle, to 
outrun life-cycle-disrupting mutations'’**>**. 


Adaptive evolution and fitness decoupling 
To allow for selection among lines, we took 120 microcosms, each con- 
taining a single WS mat, and divided these into 15 replicate populations, 
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Figure 2 | Evolution under the CE regime. a, Lines (n = 120) are arranged as 
15 replicate populations of 8 lines each (one replicate is depicted). The 
experiment involved a total of 2,400 microcosms and 140 days of selection. 
Extinction events provide opportunity for viable lines to reproduce. 
Reproduction allows export of a viable line to a new microcosm. Lines marked 
for reproduction were chosen at random and replacements took place 

within the same replicate. b, Fitness of derived lines and their single cell 
constituents relative to ancestral (ANC) types: line fitness increased 
significantly, whereas cell-level fitness decreased. One lineage generation was 
performed for each of three replicate fitness assays for each line. Error bars are 
s.e.m., based on n = 14. 


each composed of 8 lines (Fig. 2a). Lines that failed to complete the life 
cycle provided an opportunity for viable lines to export their success to 
anew microcosm. Upon the demise ofa particular lineage, a viable line 
within the same replicate was chosen at random and allowed to replace 
the extinct type (Fig. 2a). Extinction occurred at high frequency resulting 
in ~5 replacements per generation (per replicate). After ten life-cycle 
generations each population housed viable lines. Selection on lineage 
viability—and concomitantly fecundity—was thus central to persistence. 

To determine the course of evolution, the fitness of isolates from 
evolved lines was measured relative to ancestral types. A single WS geno- 
type representative of each population (of eight lines) was taken at the 
end of the selection period (see Methods and Extended Data Fig. 1a). 
In addition, 15 independent WS genotypes were obtained one muta- 
tional step from the ancestral SM genotype, thus providing a ‘baseline’ 
reference for ancestral fitness (see Methods and Extended Data Fig. 1a). 
All representative WS genotypes were competed against a lacZ-marked 
reference strain (see Methods and Extended Data Fig. 1b, left panel), 
allowing the competitive performance ofall ancestral and evolved types 
to be assessed against a single common genotype™. From a multi-level 
selection perspective’’, any trait may have different impacts on the fit- 
ness of cycling lines and individual cells. To address this issue, we intro- 
duce two different fitness measures. Fitness of lines was defined as the 
number of WS mat offspring left relative to the marked competitor (Ex- 
tended Data Fig. 1b, left panel); and cell fitness was assessed as the total 
number of cells contained within individual mats (irrespective of WS 
or SM type) at the end of the phase I period. Additional measures of 
cell-level performance are described below. 

Fitness of evolved lines—as determined by the ability to leave mat 
offspring relative to ancestral types—improved significantly (y” = 4.262, 
degrees of freedom (d.f.) = 1, P = 0.039; Fig. 2b and Extended Data 
Table 1a). This is consistent with an evolutionary response to selection 
and shows that when SM cells are integral to the reproduction of WS 
mats, lines not only persist, but their ecological performance improves. 

While fitness of evolved lines improved, cell fitness significantly 
decreased (t7. = 3.092, P = 0.0027; Fig. 2b and Extended Data Table 1a). 
This is a notable result: success at the level of evolving lineages has come 
at a cost to the individual cells of which the lines are composed. The 
fact that fitness of the evolved lines is no longer explicable in terms of 
the fitness of individual cells indicates that lineage fitness has become 
decoupled from individual cell fitness. This is consistent with theoretical 
predictions that during major evolutionary transitions selection shifts 
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from the lower (cell) to the higher (collective) level***. With such a shift 
arises a new kind of biological individual whose emergence is likely to 
curtail the independent evolution of lower-level entities’. 


Phenotypic traits underpinning lineage improvement 
Traits inherent in the individual cells must explain the improved repro- 
ductive capacity of lines*’”. To explore adaptations that contributed to 
increased fitness of lines, life history properties were determined rela- 
tive to ancestral types. Three replicate microcosms of each representa- 
tive genotype were destructively sampled each day throughout phase I 
and II and the frequency of each type determined. With interest in the 
possibility that selection might have tuned life history characteristics to 
suit the duration of each phase, the number of days of propagation was 
doubled in both phase I (12 days) and phase II (6 days). 

Evolved lines increased their capacity to generate the phenotype 
required for the next stage of the life cycle. This was evident in both 
phases: the new type was produced earlier and more reliably compared 
to the ancestral baseline lineages (7 = 5.442, d.f. = 1, P = 0.0197; Fig. 3a 
and Extended Data Table 1b). Moreover it was maximal at day six, sug- 
gesting tuning to the periodicity of the selection regime. Enhanced capacity 
to generate each stage of the life cycle was not explained by an increase 
in total cell density (density of the ancestral lineages was greater than 
that of the derived types F, = 51.521, P< 0.0001; Fig. 3b and Extended 
Data Table 1b). 

The level of SM occurrence in phase I, and WS occurrence in phase II, 
is strongly related to line fitness (Extended Data Fig. 2a, b) in both the 
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Figure 3 | Life history traits under the CE regime. a, Proportion of lines 
producing the morphotype required for the next phase of the life cycle (that is, 
SM from WS during phase I and WS from SM during phase II). b, Total cell 
density. Black, derived; grey, ancestral. Grey shaded panels indicate the 
extension of each phase as compared to the selection regime. Error bars are 
s.e.m., based on n = 15. *P < 0.05, using a generalized linear model (error 
structure: binomial; link function: logit) and subsequent post hoc contrasts per 
day for a; and analysis of variance (ANOVA) and subsequent post hoc contrasts 
per day for b. CFU, colony forming unit. 
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derived and ancestral types (evolved: 7 = 12.324, dfi=1, n= 14, 
P = 0.0004; baseline: y” = 22.801, d.f. = 1, n = 15, P< 0.0001; Extended 
Data Table 2). Notably, five ancestral lines with high fitness had a ten- 
dency towards early production of the next life phase of the life cycle 
(Extended Data Fig. 2a). After selection, this propensity for reliable pro- 
duction was present in the majority of lines (Extended Data Fig. 2b). In 
successful lines—those lines capable of completing the life cycle—this 
was not attributable to an increase in the number of cells in the next 
phase of the life cycle (F; = 0.589, P = 0.4431; Extended Data Fig. 3a 
and Extended Data Table 1b), nor to a change in the proportion of cells 
of the next phase (F; = 1.701, P = 0.1926; Extended Data Fig. 3b and 
Extended Data Table 1b). In fact the proportion of cells (and number 
of cells) marking the next stage of the life cycle was often higher in ances- 
tral lineages (Extended Data Figs 3a, b). 

The increase in the frequency of occurrence of the alternative cell 
type suggests that the transition rate between stages of the life cycle—a 
form of development—may have been the focus of selection. However, 
an alternative possibility is that the growth rate of cells has increased such 
that it is more likely for a given cell type (for example, SM) to reach critical 
numbers before the end of the phase in which it was generated (for 
example, phase I). Furthermore, higher cell growth could lead to a greater 
cumulative number of cell divisions and thus an increased chance of 
generating the next cell type (for example, WS during phase II), how- 
ever the abundance patterns in Fig. 3b stand in contrast to this idea. A 
simple mathematical model shows that the possibilities for producing 
cells of the opposing type via evolution of increased rates of switching 
readily outpaces any such possibilities arising from improvement in cell 
growth rate*’. Nonetheless, to see whether growth rate had changed we 
determined the maximum growth rate of SM cells. 

Compared to the growth rate of SM cells harvested from the ances- 
tral lineages, the growth rate of SM cells harvested from the derived 
CE lines did not increase (t74 = 1.527, P = 0.1315; Extended Data Fig. 4 
and Extended Data Table 1a). This leaves increase in the rate of trans- 
ition as a clear remaining explanation for the enhanced likelihood of 
detecting the new type in derived lines. 

Natural selection can operate ona trait at any given hierarchical level 
provided the trait is heritable and covaries with fitness at that level, but 
determining the appropriate level is challenging, especially during tran- 
sitions where selection is expected to act simultaneously at multiple 
levels”. To discern the level of selection at which traits are associated 
with fitness we performed a regression and correlation analysis on traits 
of lines and the single cells of which they are composed. The increased 
capacity to transition from WS to SM predicts line fitness in the derived 
lineages (Extended Data Figs 5a, b and Extended Data Table 2). More- 
over, line fitness is not related to, and therefore cannot be explained by, 
the competitive performance of single cells (Extended Data Fig. 5b). 
Increased line fitness is therefore a probable product of selection on prop- 
erties favouring transitions between life cycle phases—a developmental 
programme—rather than isolated success of cells within the phases. 


Genetic traits underpinning lineage improvement 


The capacity for the fittest lines to transition through phases of the life 
cycle was markedly more rapid and reliable in derived lines compared 
to ancestral types (Extended Data Fig. 6). Notable was the phenotypic 
similarity between recurrences suggestive of a specific switch-like mech- 
anism. The genome of the fittest lineage, line 17, was sequenced at gen- 
erations 4 (WSg) and 11 (WS,,) (Extended Data Fig. 7a). In addition, four 
independent replicate colonies of the 11-generation WS type (WS22) 
were transferred through three additional rounds of the two-phase life 
cycle culminating in four independent generation-14 WS types: the 
genome of one of these WS types (WS2s) was sequenced, plus its imme- 
diate SM predecessor (SM)7) (Extended Data Fig. 7a). 

If the life cycle is initially driven by spontaneous mutation, then seven 
mutations should distinguish generation-4 WS (WSs) from ancestral 
SM (SM,). Comparative analysis revealed five mutations in loci known 
to effect expression of WS and SM types”. Next we interrogated the 
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genome of the generation-11 WS (WS,,). This differed from the ances- 
tral SM type by 53 mutations (Supplementary Table 1). This unexpect- 
edly large number of mutations is a consequence of an elevated mutation 
rate attributable to a single nucleotide polymorhpism in mutS (A1489C, 
which leads to a Tyr497Pro substitution). 

An obvious question is whether the mutS (A1489C) mutation has a 
direct role in promoting switching. An alternative possibility would be 
that mutS increased the probability ofa different mutation that directly 
caused elevated switching and then hitchhiked with the switch-causing 
mutation. The mutant mus allele was reverted to wild type and the capac- 
ity of line-17 wild-type mutS (mutS") to pass rapidly and repeatedly 
through phases of the life cycle was checked. The mutS" line was sig- 
nificantly impaired in this capacity, indicating that switching depends 
directly on muts. 

The genome ofa single generation-14 WS (WS) was next compared 
to its immediate SM (SM37) progenitor (Extended Data Fig. 7a and 
Supplementary Table 1). Because of the mutator background, several 
mutations had fixed in the WS genome; however, of note was a frame- 
shift mutation in a tract of seven guanine residues in wspR (nucleotides 
742-748) resulting in an additional guanine in the SM type. wspR encodes 
a diguanylate cyclase and is one of 39 such genes that can, in principle, 
underpin expression of the WS phenotype (and its loss)”*”*’”. Given that 
the tract of guanine residues overlaps the active site of WspR (ref. 40) 
we considered the possibility that the mutS mutation might have turned 
wspR into a genetic switch enabling rapid and predictable transitioning 
between SM and WS through expansion and contraction of the tract of 
guanines*' in a manner reminiscent of a contingency locus. A dispro- 
portionate increase in the number of frameshift mutations in the mutS 
genotype, specifically in tracts of guanine (and corresponding cytosine) 
residues (Supplementary Table 1), supports this possibility”. 

The DNA sequence of wspR was obtained from every WS and SM 
genotype that arose from fourfold replication of the generation-11 line-17 
WS through three additional ‘expedited’ rounds of the life cycle (Extended 
Data Fig. 7a). As shown in Extended Data Fig. 7b many of the transi- 
tions between WS and SM from generation 10 onward correlated with 
expansion (wspR OFF) or contraction (wspR ON) of the guanine tract. 
In acontrol experiment, performed with line-17 mutS™, the slower and 
less reliable transition between phenotypic states did not, with a single 
exception, involve the tract of guanine residues. 

The existence of a genetic switch in line-17 is highly advantageous 
to the collective: it strengthens heritability between recurrences; inte- 
grates both phases into essentially a single entity; and constitutes a crit- 
ical first step in the emergence of differentiation*. Genome sequencing 
showed that the mutS-dependent switch arose in just a single lineage, 
but fixed in all eight populations of the replicate. Its evolution was reliant 
on earlier mutations in Wsp that preserved functionality of the pathway 
while ensuring constitutive activation of WspR”. It also depended on 
mutations elsewhere in the genome that exhausted alternate genetic 
routes to WS. Together, this set of prior mutations, in conjunction with 
mutS (A1489C), conferred special significance to the tract of guanine 
residues in wspR. While dependency on mutS might seem a dangerous 
liaison, the life cycle provides ample opportunity for purifying selec- 
tion to maintain integrity of the SM type and refine the switch. 


Purging cheats 

Reproduction of mats in the cheat-embracing (CE) regime underpinned 
significant evolutionary change. The causative factor is of central inter- 
est. While it is possible that a life cycle of two phases is key, reproduction 
via a bottleneck phase, combined with selection among lineages, may be 
sufficient. To test for such a possibility we performed a control experi- 
ment in which WS mats were propagated under a ‘cheat-purging’ (CP) 
regime. The sole difference between the CP and CE regimes is the cell 
phenotype that passes through the bottleneck: under the CP regime the 
cell passing through the bottleneck is a WS cell (Fig. 1a, right panel). 
Reproduction via the CP regime is analogous to fragmentation. 
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Figure 4 | Evolution under the CP regime. a, Lines (1 = 120) are arranged 
as 15 replicates of eight (one replicate is depicted). The experiment involved a 
total of 2,400 microcosms. Extinction events provide an opportunity for viable 
lines to reproduce. Reproduction allows export of a viable line to a new 
microcosm. Lines marked for reproduction were chosen at random and 
replacements took place within the same replicate. b, Fitness of derived lines 
and their single cell constituents relative to ancestral (ANC) types: both line and 
cell-level fitness increased significantly. One lineage generation was performed 
for each of three replicate fitness assays for each line. Error bars are s.e.m., 
based on n = 15. 


Conventional wisdom predicts that the CP regime will have the great- 
est evolutionary potential. Indeed, after 10 generations of lineage selec- 
tion (Fig. 4a) fitness of evolved lines improved significantly (y* = 15.737, 
df. = 1, P< 0.0001; Fig. 4b and Extended Data Table 1a). However, this 
was accompanied by a similar improvement in the fitness of single cells 
(tg6 = 2.132, P = 0.036; Fig. 4b and Extended Data Table 1a). Improve- 
ment in the fitness of evolved lines can be explained solely by improve- 
ment in individual cell performance. The striking response observed 
under the CE regime can therefore be attributed to a life cycle of alter- 
nating phases. 

To explore adaptations of the CP regime that contributed to increased 
lineage performance, life history properties were determined relative to 
ancestral types, as for the CE regime. No increase was seen in the capacity 
for WS types to transition to SM. Indeed, a significantly lower propor- 
tion of the derived groups produced SM Cm = 8.199, df = 1, P= 0.0042) 
and SM types took longer to arise (Extended Data Fig. 8a), hinting at the 
possibility that cheater suppression might have begun to evolve under 
this regime (Extended Data Figs 8b-d). 

Under the CP regime enhanced line fitness in the derived lineages is 
explained by changes in traits that improve the competitive ability of 
individual cells (Extended Data Figs 5c, d and Extended Data Table 2). 
Enhanced line fitness under the CP regime can be viewed as a by-product 
of selection at the lower (cell) level. The rate of transition between WS 
and SM correlates negatively with cell fitness parameters and is not asso- 
ciated with the CP-line fitness. In contrast to the CE regime, in which the 
WS to SM transition rate increased in a manner interpretable as a con- 
sequence of selection at the level of collectives, this trait decreased in the 
CP regime consistent with selection operating at the cell level. Evolu- 
tion of this trait in opposing directions can be viewed as resulting from 
selection at different levels. 


Perspective 

Multicellular organisms are descendants of once free-living cells 
By virtue of their capacity for differential reproduction, ancestral free- 
living cells were units of selection’’. During the transition to multicellu- 
larity, collectives of cells emerged that came to participate in Darwinian 
processes in their own right**"!””. The essential ingredient was a means 
of collective reproduction*”’. This most seminal of Darwinian proper- 
ties emerges afresh at each transition and requires explanation™. Here 
we have shown that cheating cells—those types seemingly most detri- 
mental to the persistence of newly formed cooperative entities—can 


1,3,4 
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function as a germ line within a life cycle that facilitates the reproduc- 
tion of collectives. Moreover, the two-phase life cycle presents selection 
with an altogether new kind of biological entity: each state becomes a 
different attribute ofa single organism whose evolution is unified through 
a developmental programme”. When reproduction of collectives is via 
fragmentation (a single-phase life cycle), the traits that yield success at 
the higher level are largely those that determine success of single cells. 
This offers limited opportunity for the emergence of new kinds of bio- 
logical individuality because properties of higher and lower levels remain 
aligned’>”*. 

Direct observation of early stages in an evolutionary transition requires 
that issues surrounding levels of selection be considered**"”. This nec- 
essarily leads to territory in which a range of perspectives is possible (see 
Supplementary Discussion). Our experimental design incorporates an 
ecology that is explicitly multi-level: both individual cells (that repro- 
duce once every hour), and individual lineages (that reproduce once 
every 9 days) can be units of selection; however, selection operates on 
cells and lineages over different timescales. While selection on indivi- 
dual cells favours short-term success, short-term success is unlikely to 
facilitate persistence of lineages. Indeed, persistence requires more than 
simply switching between phenotypes: it involves a developmental pro- 
gramme that underpins expression of a collective phase in which a soma- 
like body is constructed from germ-like cells. Cells of the body must 
simultaneously play an ecological role (maintaining the body near oxy- 
gen via a robust mat phenotype) while producing the seeds of the next 
generation of bodies (the germ-like cells). Given sufficient variation 
among lineages, then selection over the longer timescale stands to con- 
quer the short-term interests of individual cells. This appears to have 
happened in our CE regime with decoupling of fitness between levels 
supporting the view that selection has begun the process of transition- 
ing to the higher (collective) level—with the lower level beginning to 
function for the good of the collective. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Strains and medium. Pseudomonas fluorescens SBW25 (ref. 46) was grown at 
28 °C in 25-ml static glass microcosms containing 6 ml of King’s Medium B (with 
loose caps), and on King’s Medium B agar plates for 48 h. For competitive assays, 
strains marked with lacZ (ref. 47) allowed types to be distinguished by plating on 
media containing 60 1g ml” ' X-gal. 

CE regime. The CE regime (Fig. 1a, left panel) involved a two-phase life cycle (Fig. 1b). 
The experiment was founded by a single ancestral SBW25 genotype that initiated 
phase II. After 3 days of static incubation (phase II), 1.5 X 10’ ml (50 pl of a 
3.35 X 10°-fold dilution) was plated and a single WS colony of the most abundant 
morphotype was picked from each plate to inoculate a fresh microcosm (Fig. 1b). 
This marked the start of phase I of the first generation. Following 6 days of static 
incubation (phase I), microcosms were visually inspected to check for the presence 
ofan intact mat. Lines that had no mat at day six were deemed extinct. Microcosms 
containing lines with intact mats were vortex mixed and 2.5 X 10° ml (200 pl ofa 
8.0 X 10°-fold dilution) plated on solid media. After 48h incubation plates were 
inspected. To avoid extinction and successfully passage to phase II it was necessary 
for lines to have produced SM types. All SM colonies were transferred to 200 pl 
liquid medium and incubated for 24h under static conditions. The pooled set (from 
each microcosm) were mixed and 6 ll was used to inoculate phase II microcosms. 
CP regime. The CP regime (Fig. 1a, right panel) was identical to the CE regime 
except that during the life cycle cheats were purged and WS founded both phases 
of the cycle. Both regimes were conducted in parallel. 

Between-lineage selection. Each treatment consisted of 15 replicates of 8 com- 
peting microcosms, and replicates from both treatments were spread evenly across 
4 experimental blocks. All microcosms in the CE regime were subjected to the selec- 
tion regime outlined above; however, following assessment of mat integrity at the 
end of phase I, surviving lineages within each replicate were harvested by vortex- 
mixing and dilutions spread on agar plates (Fig. 2a). Extinct lineages (due to mat 
collapse/no mat/no SM colony post phase I, or no WS colony post phase II) were 
immediately replaced by randomly chosen surviving lines taken from the same 
replicate. On rare occasions all eight lines of a replicate were eliminated: in these 
instances one line, chosen at random, from the same experimental block, was used 
to re-found the replicate. The CP treatment was carried out as above, differing only 
following phase I when WS colonies were picked instead of SM colonies (Fig. 4a). 
A smaller volume (6.25 X 10°? ml (50 ull of a 8.0 X 10°-fold dilution)) was plated 
due to the absence of a phenotype-switching requirement for the CP treatment. 
Both treatment regimes were implemented for ten generations. 

Selection of representative WS genotypes. One single WS genotype to represent 
each replicate from the derived CE and CP ‘between-lineage selection regimes’ was 
generated after ten generations as described in Extended Data Fig. 1. The represen- 
tative WS colonies were grown in shaken microcosms (16h) and stored at —80 °C 
for post-selection analyses. This yielded 15 such types each for the ancestral and CP 
regimes, and 14 types for the CE regime (bacteria from one CE line could not be 
revived from the freezer stock). 

A lacZ-marked reference strain (W1-lacZ) for fitness comparisons in the CP 
regime (Extended Data Fig. 1b) was generated the same way from 3-day static 
microcosms inoculated with a lacZ-marked ancestral strain SBW25-lacZ (ref. 47). 
Fitness assay. Lineage- and cell-level fitness was assessed for all 44 representative 
ancestral and derived types (Extended Data Fig. 1b). The representative types were 
not directly competed against each other but against a single, neutrally marked 
reference strain assayed under the appropriate regime: the ancestral and derived 
CE types were competed against SBW25-lacZ, and the ancestral and derived CP 
types were competed against W1-lacZ. This yielded independent fitness values for 
each type (that is, potential interactions between ancestral and derived types didn’t 
affect the result), and allowed for estimates and comparison of relative perform- 
ance for all types. 

One lineage generation was performed for each of three replicate fitness assays 
for all 44 representative types (as shown in Extended Data Fig. 1b for one CE and one 
CP replicate). For each representative type, eight microcosms were each inoculated 
with one WS colony in phase I. After mat assessment surviving microcosms were 
pooled, and 2 x 10 7 ml (200 pil of a 1.0 X 10°-fold dilution) (CE) or 5 X 10° * ml 
(50 ull of a 1 X 10°-fold dilution) (CP) of this mixture plated. The total number of 
colonies (in mats) was recorded as a measure of cell fitness. All SM (CE) and WS 
(CP) colonies, and colonies of the competitor strains SBW25-lacZ and W1-lacZ 
were grown overnight and subsequently pooled as described above (CE regime). 
The derived and ancestral types were mixed with the competitor strains in pro- 
portion to their performance during phase I (details displayed in Extended Data 
Fig. 1b). Six microlitres of this mixture was used to inoculate the eight phase II 
microcosms. After three days of static incubation (phase II), microcosms were plated. 
The most abundant WS colony morphotype on each plate determined whether the 
ancestral/derived representative type or the marked reference type was more success- 
ful. The fitness of types representative of each line was calculated for each replicate 


of eight microcosms as the number of successful offspring of this type as a pro- 
portion of the total number of potential offspring (Extended Data Fig. 1b). 
Life-history analysis. Static microcosms (36 per representative genotype) were 
individually inoculated with single colonies of the representative WS types (1,584 
microcosms in total). Each day, three replicates were destructively harvested, plated, 
and the number of SM and WS colony forming units per microcosm was recorded. 
phase I was extended from 6 to 12 days, phase II from 3 to 6 days. At day six, prop- 
agules were collected for phase II, and microcosms inoculated (18 per type). Each 
day, three replicate microcosms per representative type were destructively harvested 
and number of SM and WS colony forming units recorded. 

SM growth assay. Three biological replicate SM colonies were derived from each 
of the representative ancestral and derived CE and CP WS types. Four day static 
microcosms seeded from a single WS colony were destructively harvested and plated 
(5X 10° ml). SM colonies were chosen, grown in shaken microcosms (16h), and 
stored at —80 °C. This procedure was repeated until three biological SM replicate 
colonies were obtained. For the types where three biological replicate SM colonies 
couldn’t be derived, additional experimental replicates of SM growth rate were 
assessed. In a small number of instances no SM types were obtained. 

SM growth kinetics were determined in 96-well microtitre plates shaken at 
28 °C, and absorbance (OD¢99) measured in a microplate reader (BioTek). Each 
well was inoculated with approximately 10* SM cells in 180 il King’s medium B 
and absorbance measured every 10 min for 24h. The growth of each biological 
replicate was determined in three different well locations on independent 96-well 
plates and on separate days. The maximum growth rate (Vinax) Was calculated from 
the maximum slope of the absorbance over time. The mean Vinax for each repres- 
entative type was calculated from all biological and experimental SM replicates. 
Statistical analysis. For detecting differences in line level fitness between the ances- 
tral and derived regimes, a generalized linear model (error structure: binomial; link 
function: logit) with the explanatory variables regime, and representative type (nested 
within regime) was calculated. Contrasts revealed differences in lineage level fit- 
ness between regimes. 

Analysis of variance (ANOVA) was used to test for differences in total number 
of cells, number of WS per pil, number of SM per ul (if present), and SM growth 
rate between the different regimes. Explanatory variables were regime, and rep- 
resentative type (nested within regime). Post hoc contrasts revealed differences 
between the ancestral and their respective derived regime. 

Generalized linear models (error structure: binomial; link function: logit) were 
used to test for the difference between regimes in life-history parameters during the 
course of the experiment. The response variable was ‘proportion of microcosms 
with the new type’. Explanatory variables were regime, representative type (nested 
within regime), and time. Analysis of variance was performed with the same explan- 
atory variables but for ‘new cell type per pl’, ‘total cells per pl’, and ‘proportion of the 
new cell type within a microcosny’. All three variables were Box-Cox transformed. 
Contrasts revealed differences between the regimes on the individual days. 

Relationships between all parameters were tested using the mean per repres- 
entative type accounting for regime. Pearson and Spearman rank correlations, 
and regressions (line level fitness: generalized linear models with the normal error 
structure, and the identity link function; cell-level fitness and number of SM per 
ul (if present): general linear models) were performed. SM growth rate, number of 
SM per ul (if present), and SM/WS occurrence were Box—Cox transformed. 

Colony counts from contaminated plates were excluded from analyses. Sample 
size was chosen to maximise statistical power and ensure sufficient replication. 
Assumptions of the tests, that is, normality and equal distribution of variances, were 
visually evaluated. Non-significant interactions were removed from the models. 
All tests were two-tailed. Effects were considered significant at the level of P< 0.05. 
All statistical analyses were performed with JMP 9. Graphs were produced with 
GraphPad Prism 5.0, JColorgrid (http://jcolorgrid.sourceforge.net), Adobe Illustrator 
CC 17.0.0 and Inkscape 0.48.2. 

Genetic manipulation. Standard genetic techniques were used to revert mutS 
(A1489C) to wild type. This involved PCR amplification of the wild-type sequence 
from SBW25 and its integration into the genome of WS,, by homologous recom- 
bination facilitated by a two-step allelic replacement strategy using pUIC-3 (ref. 27). 
Characterization of G-tract expansion and contraction in wspR was performed by 
PCR and Sanger sequencing. 

Genome sequencing. Genomic DNA was extracted from overnight cultures (each 
founded by a single colony) using Wizard Genomic DNA Purification Kit (Promega), 
and sent to the Australian Genome Research Facility and Pacific Biosciences, for 
Illumina and SMRT sequencing, respectively. Libraries for SMRT sequencing were 
prepared on a Sciclone NGS automated liquid handling workstation (Perkin Elmer, 
Waltham MA) using a 10-kb automated library preparation protocol (Pacific Bio- 
sciences, Menlo Park, CA) which includes an initial Solid Phase Reversible Immo- 
bilization (SPRI) clean-up step, followed by the standard 10-kb library preparation 
protocol. Libraries were annealed using 20X excess primer using the standard annealing 
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protocol (Pacific Biosciences). Sequencing was performed on the PacBio RSII 
using P4-C2 sequencing chemistry, magnetic bead loading, and 3-h movie acqui- 
sitions. Sequence reads were mapped against the P. fluorescens SBW25 GenBank 
reference and variants were called using SMRTPortal version 2.3. Additional ana- 
lyses were performed with Geneious 7.1.4. 
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Extended Data Figure 1 | Representative WS genotypes and fitness assays. 
a, To analyse the response of derived lines to selection, a single representative 
WS genotype was obtained from each replicate population (set of eight 
microcosms) from both ancestral and derived lines. To obtain the 
representative set of derived types under the CE regime, SM colonies were 
collected from the end of phase I at generation 10 and pooled. The pooled 
sample was used to found phase II, at the end of which a single WS-type 
representative of each replicate was selected as described for the baseline (see 
below). This yielded 14 such types, one representing each replicate. To obtain 
the set of baseline types representing the ancestral state, SBW25 was used to 
found phase II. At the end of the 3-day period lines were harvested and plated. 
The single most abundant WS type from the most densely populated plate was 
selected as representative of that replicate. A third set of representative WS 


genotypes was obtained from lines evolved under the CP regime. This was as for 
the CE regime, but instead of pooling SM at the end of phase I, WS were 
collected and pooled. A single dominant WS type was chosen from each 
replicate. b, Cell- and line-fitness assays. Lines founded by representative WS 
genotypes (from ancestral and derived lineages) were competed against a 
marked (blue colonies) reference strain (SM and WS, for the CE and CP 
regimes, respectively). Use of the marked reference strain allowed the 
competitive performance of all ancestral and derived types to be assessed 
against a single common genotype. Cell fitness is the total number of cells in the 
mat after phase I, whereas line fitness is the proportion of evolved ‘offspring’ 
mats relative to a marked reference strain. In total 2,472 microcosms were 
assayed (three replicate assays per line). 
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Extended Data Figure 2 | Line fitness and life cycle perpetuation. 


a, b, Fitness of ancestral (a) and derived (b) lines and relationship with capacity 
to perpetuate the two-phase life cycle. Data are a breakdown of data in Fig. 3a. 


Colour intensity represents the proportion of three replicate microcosms 


0123456 
Day 


harbouring the new type (white, absence of new type; dark red (yellow), 
presence of SM (WS) in all microcosms). Lines are ordered according to their 
fitness as assayed under the CE regime. Grey cells indicate loss of lines due 
to extinction. 
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Extended Data Figure 3 | Life history traits under the CE regime. a, Cell failed to produce the required type were excluded. Black, derived; grey, 
density of the new type. b, Proportion of the new cell type divided by the total ancestral. Error bars are s.e.m., based on n = 15. *P < 0.05, using analysis 
number of cells. Each circle represents the mean of 42-45 lines (that is, of variance (ANOVA) and post hoc contrasts. 

three replicates for each of the 15 ancestral and 14 derived lines). Lines that 
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Extended Data Figure 4| Growth rate of SM. Growth rate of the ancestral (ANC) and derived SM types from the CE and CP regimes obtained from the 
representative genotypes (biological and technical replicates; see Methods). (ANC, n = 81; CE, n = 95; CP n = 81). Error bars are s.e.m., based on n = 15. 
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Extended Data Figure 5 | Relationship between fitness and associated traits. 
a-d, Summary of parameters describing line and cellular properties and the 
relationship among these parameters in ancestral (a) and derived (b) CE and 
ancestral (c) and derived (d) CP populations. Traits in the evolved populations 
(b, d) are depicted relative to their respective ancestral states (a, c): significant 
increase (large red circle), significant decrease (small blue circle), and no 
significant change (grey circle) using a generalized linear model (error 
structure: binomial; link function: logit) for line fitness and WS—SM with 
post hoc contrasts; and analysis of variance (ANOVA) for number of cells, 
number of WS, number of SM, and SM growth rate with post hoc contrasts. 
WS-—SM, proportion of lines producing SM during phase I. Arrows indicate 
significant regressions and lines indicate significant correlations between traits, 
dashed lines indicate trends (0.05 < P< 0.09). The colour represents the 
direction of the relationship: red, positive; blue, negative. The significance level 
is P< 0.05 using Pearson and Spearman rank correlations, and regressions 
(line level fitness: generalized linear models; cell-level fitness and number of SM 
per tl (if present): general linear models). Individual cell properties displayed in 
a and c are identical for the ancestral state in both CE and CP regimes, but 
measures of line fitness are regime-specific, and transform the associations 


between parameters. Parameters that relate positively to line fitness in the CE 
regime negatively affect line fitness in the CP regime, and vice versa (a versus 
c). For example, in the CE regime, the number of SM cells and the rate at 
which WS cells give rise to SM cells positively regresses on line fitness (red 
arrows, a), whereas only the number of WS cells shows a positive regression 
with line fitness in the CP regime (red arrows, c). The relationships between 
parameters in the ancestral populations predict their evolutionary trajectory in 
each regime. After 10 generations of line selection the relationships between 
cell- and line-level parameters significantly altered in both CE and CP regimes 
(b and d). Line fitness improved in both regimes, thereby imposing selection on 
parameters that were linked to line fitness in their respective baselines. In the 
CE regime enhanced line fitness is explained by a significant increase in the 
capacity to transition from WS to SM and is not explained by enhanced 
performance of single cells: the fitness of single cells either remained unaltered 
or declined. Increased line fitness can be seen as a product of selection at the 
higher (group) level. In marked contrast is the CP regime where improved line 
fitness is readily explained by changes in traits that improve the competitive 
ability of individual cells. Enhanced line fitness in the CP regime can be 
interpreted as a by-product of selection at the lower (cell) level. 
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Extended Data Figure 6 | Cycling through phases. Ancestral and derived 
lines differ in their capacity to transition between phases of the life cycle. Three 
replicate populations of the two ancestral and derived lineages with the highest 
fitness: 71, 73 and 16, 17, respectively, were founded by the representative WS 
type and plated to check for SM types. Whereas ancestral WS took 48h to 
generate detectable levels of SM, the two derived WS populations contained a 
mixture of WS and SM colonies, such that even at the time of initial inoculation, 
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both types were present. SM colonies were then used to found populations 
that were plated to check for WS types. Ancestral lineages 71 and 73 completed 
two cycles before extinction through failure to produce SM, whereas derived 
lines 16 and 17 completed five cycles before termination of the experiment. 
Colonies were photographed after 48 h of growth. Notable in the evolved lines 
is the visible presence of zones of SM cells surrounding the central WS colony. 
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Extended Data Figure 7 | Mechanism of life cycle transition. a, Outline of 
the evolutionary history of line 17 from its founding by ancestral SM, SBW25 
through ten generations of the life cycle, by which time a mutS mutation had 
arisen. At the eleventh generation, the mutS (A1489C) mutation in WS. was 
reverted to wild type (mutS”") by in vitro manipulation. The two WS), lineages 
(with and without the mutS mutation) were taken through three additional 
‘expedited’ life-cycle generations, although with a cycling period of 24h for 
mutS (A1489C) and 48h for mutS“. Genome sequences were acquired from 
five different time points (SM, WSg, WS22, SM27 and WSog, indicated by 
surrounding boxes). For details of mutations see Supplementary Table 1. b, The 
mutS-dependent genetic switch. Four independent cultures of the generation- 
11 WS (WS,,), with and without the mutS (A1489C) mutation, were passaged 
through an additional three ‘expedited’ generations of the life cycle 


(WS2.-WSyg). The nucleotide sequence of wspR was determined at each 
stage. Depicted is the tract of guanine residues beginning at nucleotide 742: 
WspR is active (and the phenotype is WS) when the tract length is seven 
residues, but inactive (and the phenotype is SM) when the tract length is 
eight. Red arrows show slippage events resulting in the gain or loss of a single 
guanine residue. Transitioning between phases is more reliable (fewer 
extinction events) and more likely to occur via the tract of guanine residues 
in the presence of mutS (A1489C). Lines of mutS (A1489C) were passaged on 
a 24-h cycle; mutS” lines were passaged on a 48-h cycle (on a 24-h cycle 
mutS lines were extinct before the first generation completed—ancestral 
lines are incapable of transitioning on a 48-h cycle). Extinction events occurred 
whenever the extant phase failed to produce the next stage in the life cycle; 
death of a line allowed birth of an extant lineage (diagonal arrows). 
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Extended Data Figure 8 | Life history traits under the CP regime. 


a, Proportion of lines producing SM. b, Total cell density. c, Cell density of 
the new type. d, Proportion of SM cell types divided by the total number of cells. 
Each square represents the mean of 45 lines (that is, 3 replicates for each of 
the 15 lines); however, for cand d lines that failed to produce SM were excluded. 
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Black, derived; grey, ancestral. Error bars are s.e.m., based onn = 15.*P < 0.05, 
using a generalized linear model (error structure: binomial; link function: 
logit) and post hoc contrasts for a; and using analysis of variance (ANOVA) and 


post 


hoc contrasts for b-d. 
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Extended Data Table 1 | Differences in life history parameters 


a 
Parameters N df F/X’ P R 
Line fitness Ma 
Full model 1392 59 1160.211 <0.0001 
Regime 3 61.429 <0.0001 
Rep. type (Regime) 56 1090.351 <0.0001 
Cell fitness F 
Full model 126 43 4.404 <0.0001 0.698 
Regime 2 13.524 <0.0001 
Rep. type (Regime) 41 3.916 <0.0001 
#WS F 
Full model 126 43 5.508 <0.0001 0.743 
Regime Pa 28.282 <0.0001 
Rep. type (Regime) 41 4.339 <0.0001 
#SM X 80 Fe 
Full model 31 5.148 <0.0001 0.769 
Regime 2 2.358 =0.1055 
Rep. type (Regime) 29 5.190 <0.0001 
SM growth rate Fr 
Full model 104 36 6.450 <0.0001 0.777 
Regime 2 32.806 <0.0001 
Rep. type (Regime) 34 5.101 <0.0001 
ws > SM x 
Full model 1557 43 565.727 <0.0001 
Regime 2 289.108 <0.0001 
Rep. type (Regime) 41 467.060 <0.0001 
b 
Parameters N df F/ P 
Proportion of lines with ¥ 
new type 
Full model 1357 64 690.216 <0.0001 
Regime 1 5.442 =0.0197 
Rep. type (Regime) 27 ~=©457.619 <0.0001 
Time 18 385.403 <0.0001 
Regime x Time 18 62.994 <0.0001 
Total # cells/yl X Fr 
Full model 1355 46 21.697 <0.0001 0.433 
Regime 1 51,521 <0.0001 
Rep. type (Regime) 27 8.144 <0.0001 
Time 18 39.887 <0.0001 
Regime x Time ns. 
# new type/pl X F 
Full model 740 46 8.823 <0.0001 0.369 
Regime 1 0.589 =0.4431 
Rep. type (Regime) 27 5.620 <0.0001 
Time 18 13.615 <0.0001 
Regime x Time ns. 
Proportion new cell type F 
Full model 740 46 6.447 <0.0001 0.300 
Regime i | 1.701 =0.1926 
Rep. type (Regime) 27 6.648 <0.0001 
Time 18 6.229 <0.0001 
Regime x Time ns. 


a, Differences in line fitness, cell fitness and life-history traits between ANC, CE and CP regimes. Results from generalized and general linear models are shown. Values for ancestral lines are the same for all 
parameters measured during phase I, whereas line fitness differs between ancestral CE and CP regimes. Rep. type, representative WS genotype; #WS, number of WS at day 6 within a mat, #SM X, number of SM at 
day 6 within a mat (SM = O were excluded); X, Box—Cox transformation; WS-—SM, proportion of lines producing SM during phase |. b, Differences in life history traits between ancestral and evolved CE regimes over 
time. Results from generalized and general linear models are shown (all three parameters were Box—Cox transformed). Bold denotes significance at the level of P< 0.05. 
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Extended Data Table 2 | Relationship between fitness and life history parameters 
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a 
SM growth X #SM/l X #WS WS 3SM X Line fitness Cell fitness 
- R°=0.449, r=-0.372, r=0.222, ¥°=0.711, r=-0.218, 
F=4.893, P=0.2335, P=0.4882, df=1, P=0.4964, 
SM growth X P=0.0690, N=12 N=12 P=0.3992, N=12 
N=8 N=12 
R*=0.007, - rs=-0.667, rs=0.619, x7=26.801, R*=0.380 
F=0.087, P=0.0710, P=0.1017, df=1, F1,9=3.671, 

#SM/ul X P=0.773, N=8 N=8 P<0.0001, P=0.1038 
N=14 N=8 
r=0.381, rs =0.007 - r=-0.689, x7=11.150, R’=0.339, 
P=0.1796, P=0.9822, P=0.0045, df=1, F,,15=6.673, 

#WS N=14 N=14 N=15 P=0.0008, P=0.0227 

N=15 
r=0.063, rs=0.587, r=0.114, - x7=22.801, r=-0.378, 
P=0.8297, P=0.0274, P=0.6987, df=1, P=0.1645, 
WS SM X N=14 N=14 N=14 P<0.0001, N=15 
N=15 
¥°=0.061, x7=9.305, ¥7=1.008, 47=12.324, - 7=4.246, 
: df=1, df=1, df=1, df=1, df=1, 

Line fitness P=0.8046, P=0.0023, P=0.3154, P=0.0004, P=0.0393, 
N=14 N=14 N=14 N=14 N=15 
r=0.486, R°=0.009, R’=0.890, r=0.326, x°=2.041, - 
P=0.0783, Fy,14=0.113, F4,14=97.359, P=0.2558, df=1, 

Cell fitness N=14 P=0.7428 P<0.0001 N=14 P=0.1531, 

N=14 
b 
SM growth X #SM/pl X #WS WS3SM X Line fitness Cell fitness 
: R*=0.449, r=-0.372, r=0.222, ¥°=0.705, r=-0.218, 
F=4.893, P=0.2335, P=0.4882, df=1, P=0.4964, 
aM growin P=0.0690, N=12 N=12 P=0.4010, N=12 
N=8 N=12 
R’=0.011, - rs=-0.667, rs=0.619, x7=17.568, R*=0.380 
F=0.0656, P=0.0710, P=0.1017, df=1, F1,9=3.671, 

#SMiul X P=0.8064, N=8 N=8 P<0.0001, P=0.1038 
N=8 N=8 
r=0.215, rs=0.222, - r=-0.689, x7=6.153, R*=0.339, 
P=0.5265, P=0.5372, P=0.0045, df=1, F,15=6.673, 

#WS N=11 N=10 N=15 P=0.0131, P=0.0227 

N=15 
r=0.221, rs=0.616, r=-0.669, - x7=8.710, r=-0.378, 
P=0.4129, P=0.0580, P=0.0064, df=1, P=0.1645, 
WS SM X N=11 N=10 N=15 P=0.0032, N=15 
N=15 
7=5.567, ¥7=0.312, 47=7.552, 47=2.129, - ¥°=0.746, 
; df=1, df=1, df=1, df=1, df=1, 

Line fitness P=0.0183, P=0.5762, P=0.0060, P=0.1445, P=0.3878, 
N=11 N=10 N=15 N=15 N=15 
r=0.290, R°=0.074, R’=0.887, r=-0.473, x7=5.339, - 

P=0.3874, F4,10=0.640, F,15=101.583, P=0.0750, df=1, 
Cell fitness N=11 P=0.4470 P<0.0001 N=15 P=0.0208, 
N=15 


a, CEregime, and b, CP regime. X denotes parameters that were Box-Cox transformed to meet requirements of normality and equal variance. WS—SM, proportion of lines producing SM during phase I. Above the 


diagonal are tests for the ancestral regimes (grey), below for the evolved regimes (black). Bold denotes significance at the level of P< 0.05. 
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Architecture of mammalian respiratory 


complex I 


Kutti R. Vinothkumar'*, Jiapeng Zhu?* & Judy Hirst? 


Complex I (NADH:ubiquinone oxidoreductase) is essential for oxidative phosphorylation in mammalian mitochondria. It 
couples electron transfer from NADH to ubiquinone with proton translocation across the energy-transducing inner 
membrane, providing electrons for respiration and driving ATP synthesis. Mammalian complex I contains 44 different 
nuclear- and mitochondrial-encoded subunits, with a combined mass of 1 MDa. The 14 conserved ‘core’ subunits have 
been structurally defined in the minimal, bacterial complex, but the structures and arrangement of the 30 ‘supernumerary’ 

subunits are unknown. Here we describe a 5 A resolution structure of complex I from Bos taurus heart mitochondria, a 
close relative of the human enzyme, determined by single-particle electron cryo-microscopy. We present the structures 
of the mammalian core subunits that contain eight iron-sulphur clusters and 60 transmembrane helices, identify 18 super- 
numerary transmembrane helices, and assign and model 14 supernumerary subunits. Thus, we considerably advance 
knowledge of the structure of mammalian complex I and the architecture of its supernumerary ensemble around the core 
domains. Our structure provides insights into the roles of the supernumerary subunits in regulation, assembly and homeo- 


stasis, and a basis for understanding the effects of mutations that cause a diverse range of human diseases. 


Mammalian complex I (ref. 1) is one of the largest and most complicated 
enzymes in the cell. Complex I from B. taurus (bovine) heart mitochon- 
dria has been characterized extensively as a model for the human enzyme; 
both enzymes contain 44 different subunits (encoded by both the nu- 
clear and mitochondrial genomes)** and nine redox cofactors (a flavin 
mononucleotide and eight iron-sulphur clusters). Fourteen subunits 
are the core subunits that are conserved in all complex I enzymes; they 
contain all the mechanistically critical cofactors and structural elements 
and are sufficient for catalysis. Crystal structures of intact complex I 
from the thermophilic bacterium Thermus thermophilus’, and of domains 
of the prokaryotic enzymes from T. thermophilus and Escherichia coli?” 
have provided a wealth of information on the structures of these sub- 
units—but they represent only half the mass of the mammalian enzyme. 
The cohort of 30 supernumerary subunits particular to the mammalian 
enzyme” has been accumulated through evolution. The supernumer- 
ary subunits may have alternative functions or be important for assem- 
bly, regulation, stability or protection against oxidative stress—their 
structures and arrangement around the core subunits are not known. 

Owing to its size, L-shaped asymmetry, membrane-bound location, 
and multi-component structure, mammalian complex I has proved dif- 
ficult to crystallize, and its high-resolution structure has not yet been 
determined. Crystallographic information on any eukaryotic complex I 
is currently limited to a medium-resolution map of the enzyme from the 
yeast Yarrowia lipolytica, which has been described, but not modelled’. 
Conversely, the size and shape of complex I make it an attractive target 
for electron microscopy (EM), and the enzymes from several species 
have been visualized to display their overall L-shaped structures”, al- 
though at too lowa resolution to reveal detailed structural information. 
A high-resolution structure of the mammalian enzyme is essential for 
understanding how the 30 supernumerary mammalian subunits are 
arranged around the core domain, how they determine the properties, 
assembly and activity of the enzyme, and how mutations in both the core 
and supernumerary subunits cause human diseases’. 


Imaging and reconstruction 

Complex I was purified from B. taurus heart mitochondria in detergent’, 
and imaged in vitreous ice on holey-carbon grids with a Falcon direct 
electron detector (see Methods). The enzyme adopts different orien- 
tations on the grid, and reference-free two-dimensional class averages 
clearly show the characteristic L-shape of the minimal prokaryotic form 
augmented by extra domains from the supernumerary subunits (Ex- 
tended Data Fig. 1). Refinement was performed in RELION™ and movie 
frames were used to correct for beam-induced movement’. Per-frame re- 
construction and B-factor weighting were followed by three-dimensional 
classification, resulting in the final map (Fig. 1) obtained from 25,492 
particles with an overall resolution of ~5 A (see Methods and Extended 
Data Fig. 2). Viewed at a low-density threshold the map is dominated 
bya disordered detergent-phospholipid belt that encircles the hydropho- 
bic domain and defines the position of the membrane. At intermediate- 
density threshold, the hydrophilic matrix domain and the extended 
membrane domain, containing a large number of transmembrane «helices 
(TMHs), are observed. The highest-density peaks in the map reveal the 
eight iron-sulphur (FeS) clusters that, as in the T. thermophilus*’ and 
Y. lipolytica® enzymes, form a chain through the hydrophilic domain. 


Structures of the core subunits 


The 14 conserved core subunits of complex I (refs 1, 4) catalyse the en- 
ergy transducing reactions: NADH oxidation, ubiquinone reduction and 
proton translocation (Extended Data Table 1 summarizes their nomen- 
clature). The seven nuclear-encoded hydrophilic core subunits harbour 
a flavin mononucleotide to oxidize NADH, FeS clusters for inter- 
substrate electron transfer, and the ubiquinone-binding site. The seven 
mitochondrial-encoded membrane core subunits contain four antiporter- 
like domains for proton translocation. The structures of the mammalian 
core subunits (Fig. 2) were fitted to the density map (see Methods) using 
the structure of T. thermophilus complex I (ref. 4), secondary structure 
analyses and sequence alignments, and using structural features and 
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Figure 1 | Overall map for complex I from B. taurus heart mitochondria 
determined by single-particle cryo-EM. Three distinct features of the 
complex are revealed by overlaying maps at different density thresholds. The 
map at the highest threshold (red) reveals the FeS clusters. The map at 
medium threshold (grey) reveals the overall architecture of the protein and the 
78 TMHs in the membrane domain. The detergent-phospholipid belt observed 
as a dominant feature at low density threshold (translucent blue) represents 
the density that remains around the membrane domain after cutting out the 
final model of the protein, and denotes the position of the complex in the 
membrane. It is ~30 A thick, and 3-4 A thinner at the proximal end of the 
complex (left) than at the distal end (right). 


densities from aromatic side chains (Extended Data Fig. 3). Except for 
the FeS-cluster ligands they have been modelled as polyalanine chains, 
with the residue numbering optimized to enable individual residues to 
be located (Extended Data Table 2). It is not possible to attribute den- 
sity to any bound ubiquinone species in the present map. 

A comparison of the bacterial* and mammalian core enzymes reveals 
that the mammalian membrane domain is more strongly curved ‘out’ 
of the membrane plane (Extended Data Fig. 4). However, within each 
individual subunit the 60 TMHs of the mammalian core subunits match 
their T. thermophilus counterparts closely (Extended Data Fig. 5)—only 
the position of TMH4 in subunit ND6 is different, and the extra carboxy- 
terminal TMH particular to T. thermophilus ND1 is absent from B. 
taurus (Fig. 2 and Extended Data Fig. 5). No notable density is observed 
in place of the three amino-terminal TMHs (present in T. thermophilus 
and Y. lipolytica) that have been lost through evolution of mammalian 
ND2 (ref. 16), so they have not been substituted structurally by other 
subunits. Importantly, catalytically relevant features identified in the 
antiporter-like subunits of the bacterial complex*® are conserved. They 
include the loops in the six broken TMHs in ND2, ND4 and ND5 (see 
Extended Data Fig. 3 for examples) that may constitute part of the proton- 
translocation mechanism, and the long transverse helix in NDS, a pro- 
posed coupling element. 

Of the seven hydrophilic core subunits (Fig. 2), the structures of the 
B. taurus 51 kDa subunit (human homologue NDUFV1), 49 kDa (NDUFS2), 
24 kDa (NDUFV2), PSST (NDUFS7) and TYKY (NDUEFSS8) subunits, 
and the small domain of the 75 kDa subunit (NDUFS1) are closely con- 
served from their T. thermophilus homologues*’ (Extended Data Fig. 5), 
with marked variation only in the length and extent of some of their N 
and C termini. Consequently, the arrangements of the FeS cluster chains 
are also very similar (Extended Data Table 3), except that, owing to ro- 
tation of the 51 and 24 kDa subunits, the superimposed chains diverge 
with increasing distance from the membrane (Extended Data Fig. 4). 
The sequence and structural conservation of the large domain of the 
75 kDa subunit, which contains an extra, catalytically redundant clus- 
ter in T. thermophilus’ and the 30 kDa subunit (NDUFS3), are lower 
(Extended Data Table 2). As neither of them have any known catalytic 
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Figure 2 | Structures of the core subunits of mammalian complex I. 

a, Structural models of the 14 mammalian core subunits (cartoon 
representation) and their density (transparent surface); the subunits are 
coloured individually and labelled with text in the same colours. The chain 
of FeS clusters is shown modelled to the highest density peaks (blue mesh) in 
the inset. b, The seven membrane-bound mammalian core subunits, viewed 
from the matrix. Arrows indicate the positions of the four TMHs in T. 
thermophilus that are not present in B. taurus: three N-terminal TMHs in 
ND2 and one C-terminal TMH in ND1. The position of TMH4 in ND6 is 
different in B. taurus and T. thermophilus (marked with stars). For a detailed 
comparison of the B. taurus and T. thermophilus structures see Extended 
Data Figs 4 and 5. 


role, we conclude that the catalytically critical subunits and cofactors 
are closely conserved in the mitochondrial and bacterial enzymes, sup- 
porting their common mechanism of catalysis. 


The supernumerary ensemble 


Once the core subunits had been modelled, the map revealed that addi- 
tional densities form an open cage around the core (Fig. 3). These den- 
sities are attributed to the supernumerary subunits, and they are arranged 
predominantly around the membrane domain and lower hydrophilic 
domain, where they may help to protect FeS-containing PSST and TYKY 
from oxidative damage. Conversely, the area around the NADH-binding 
site, where complex I produces superoxide”, is bare (Fig. 3), so super- 
numerary subunits do not shield it from O, to minimize superoxide 
production. The NADH dehydrogenase domain is added at the end of 
the complex I assembly pathway"’, and the local paucity of supernu- 
merary subunits may facilitate both its integration and its replacement 
(while retaining the rest of the protein) to mitigate the effects of oxi- 
dative damage’’. Two large supernumerary domains, one capping ND5 
and part of ND4, the other capping ND2, are observed on the matrix 
surface of the membrane domain. Facing the intermembrane space, as 
noted in Y. lipolytica®, the supernumerary subunits form a layer of pro- 
tein that may have a role similar to that of the stabilizing B-hairpin- 
helix structures observed in the prokaryotic enzyme’. Eighteen super- 
numerary TMHs are distributed around the core membrane domain 
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(Figs 3 and 4), consistent with the predictions of sequence analyses for 
14 to 18 TMHs from these subunits (Extended Data Table 4). In total, 
therefore, we observe 78 TMHs in the mammalian enzyme. Two TMHs 
are on the outside of the ND5 transverse helix, appearing to strap it to 
the core domain, and four more are positioned close to the end of it, 
appearing as a restraint for its lateral movement (Fig. 3). These obser- 
vations raise the question of whether large-scale piston-like motions 
of this helix during catalysis, as postulated from the T. thermophilus 
structure’, are feasible. 


Assignment of 14 supernumerary subunits 

To identify and assign individual supernumerary subunits to the map 
for mammalian complex I (Extended Data Table 4 summarizes their 
nomenclature) we used biochemical, sequence and structural informa- 
tion. Homology models for six of the hydrophilic supernumerary sub- 
units were created using known structures (Extended Data Tables 4 and 5). 
Human B8 (NDUFA2) adopts a thioredoxin fold” and its structure 
(Fig. 5) was located at the tip of the large domain of the 75 kDa subunit 
(Fig. 4), so (contrary to current models’*) B8 is likely to be assembled 
into complex I after (or with) the 75 kDa subunit. B8 is extensively de- 
graded in brain mitochondria from patients with Parkinson’s disease”’, 
and, along with other NADH dehydrogenase domain subunits, it is rap- 
idly exchanged under steady-state conditions”’. Therefore, it may help 
to protect the core enzyme against oxidative damage. Similarly, regions 
of density consistent with two subunits important for complex I assembly’’, 
the 18 kDa (NDUFS4) and 13 kDa (NDUEFS6) subunits (Extended Data 
Tables 4 and 5), were located (Fig. 4). However, they are small proteins 
with no predicted dominant secondary structure and it cannot be ex- 
cluded that other supernumerary subunits have similar structures. In 
the current map, the 18 kDa subunit has been modelled into a density 
ina cleft between the 75 kDa subunit and the 49 kDa, 30 kDaand TYKY 
subunits; the density attributed to the 13 kDa subunit suggests that it 
interacts with the 75 kDa, 49 kDa and TYKY subunits (Fig. 4). These lo- 
cations may explain why clinically identified mutations in the 18 kDa and 
13 kDa subunits lead to accumulation of late-stage interrupted-assembly 
intermediates lacking the NADH-dehydrogenase module”. 

The 42 kDa subunit (NDUFA10), a member of the nucleoside ki- 
nase family”, was easily located as the density on top of ND2, on the 
matrix side of the membrane (Figs 4, 5 and Extended Data Tables 4, 5). 
Its location is neatly confirmed by its absence from the density map of 
Y. lipolytica complex I (ref. 8), which lacks this mammalian-specific 
subunit. Phosphorylation ofa serine in the 42 kDa subunit by a PINK1- 
dependent mechanism has been proposed to be required for complex I 
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Figure 3 | Architecture of 
mammalian complex I showing the 
densities of the supernumerary 
subunits enclosing the core 
domain. The models for the core 
subunits are in light colours (as 
labelled) in surface representation, 
and density attributed to the 
supernumerary subunits, forming a 
cage around the core subunits, is in 
dark red. The supernumerary 
subunits are concentrated on each 
side of the membrane domain, and 
around the lower section of the 
hydrophilic domain. The NADH- 
binding site in the 51 kDa subunit is 
indicated, with the predicted 
positions for the flavin isoalloxazine 
(orange spheres) and three conserved 
phenylalanines at the entry to the 
site (yellow); the vicinity of this site is 
devoid of supernumerary subunit 
density. 
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Figure 4 | Structural assignments of supernumerary subunits in 
mammalian complex I. a, A semi-transparent surface for the density map for 
mammalian complex I is shown in pale grey, with the surface from the core 
subunits in wheat. Structural models for the supernumerary subunits are shown 
in colour and labelled accordingly (dashed lines indicate subunits on the back 
of the structure). Subunits labelled with brackets are those with less certain 
assignments, and structural elements, which cannot be assigned confidently 
in the current map, are in blue. b, Arrangement of TMHs, viewed from the 
matrix. The core subunits are in light colours (wheat for ND1, ND4 and NDS, 
green for ND2, ND3, ND4L and ND6). The supernumerary subunits are 
coloured as in a. 
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Figure 5 | Structural models for supernumerary subunits in mammalian 
complex I. a, Models for three supernumerary subunits in cartoon 
representation, coloured from blue to red (N to C termini). b, Structural models 
and relationships of supernumerary subunits to the core structure. B14.7 is 
located at the end of the transverse helix, next to ND5-TMH16. The density 
assigned to PGIV forms an L-shaped ‘clip’ over B16.6, which bends around 
the heel at ND1. Finally, the supernumerary subunits around the lower section 
of the hydrophilic domain are viewed in cartoon representation from the 
matrix. The core membrane subunits (white) and four core hydrophilic 
subunits, the 49 kDa (blue), 30 kDa (green), PSST (yellow) and TYKY (cyan) 
subunits, are shown in surface representation. 


activity”*, implying both its regulatory role anda molecular link between 
PINK1 dysfunction and complex I activity in Parkinson’s disease. Fur- 
ther elucidation of this regulatory pathway must now be reconciled with 
the matrix location of the 42 kDa subunit. The 39 kDa subunit (NDUFA9), 
a member of the nucleotide-binding short-chain dehydrogenase/reductase 
family** (Extended Data Tables 4 and 5), was readily located adjacent 
to PSST (Figs 4 and 5), and observed to contain a density consistent 
with a bound dinucleotide (Extended Data Fig. 3)*”. Furthermore, it par- 
tially encloses the long ND3 loop (resolved only in T. thermophilus*) 
that is critical for coupling electron and proton transfer, and which, 
in a conformational transition now known to also involve the 39 kDa 
subunit”, switches the enzyme into a ‘deactive’ state during ischaemia. 
Notably, these proposed regulatory elements are all located close to the 
junction between the hydrophilic and membrane domains where the en- 
ergy from the redox reaction is used to initiate proton translocation’. 

Two regions of density corresponding to the structure of the SDAP 
subunit (NDUFAB1), which is identical to the acyl carrier protein in 
the mitochondrial matrix”*°, were identified in the mammalian en- 
zyme (Fig. 4). This result is supported by the presence of SDAP in both 
subcomplex Ia (which contains the hydrophilic domain) and subcom- 
plex If (the distal portion of the membrane domain) of B. taurus com- 
plex I (ref. 3), and with the presence of two SDAP homologues in Y. 
lipolytica complex I (ref. 31). One SDAP is located at the distal end of 
the enzyme, above ND5, the other in a peripheral region of the hydro- 
philic domain, in a subdomain that interacts with the 49 kDa and 30 kDa 
core subunits through a three-helix bundle (Figs 4 and 5). From a recent 
study in Y. lipolytica® these helices are assigned to subunit B14 (NDUFA6), 
a protein with an LYR motif that, when deleted in Y. lipolytica, results 
in loss of catalytic activity. Notably, subunit B22 (NDUFB39) also con- 
tains an LYR motif and it is in subcomplex If, so it is possible that the 
distal SDAP molecule interacts with it in a similar fashion. Finally, sub- 
units B13 (NDUFAS) and B14 have similar predicted secondary struc- 
tures so we ascribe the second three-helix bundle observed on the side 
of the 30 kDa subunit, adjacent to the 42 kDa subunit, to B13 (Figs 4, 5). 


ARTICLE 


Continuous density links the four supernumerary TMHs at the end 
of the transverse helix. Only one subunit, B14.7 (NDUFA11), is predicted 
to contain more than two TMHs (Extended Data Table 4); secondary 
structure analyses predict that the first two are unusually long (~30 
residues), so they probably correspond to the two highly tilted TMHs. 
Therefore, these four TMHs are assigned to subunit B14.7 (Figs 4, 5 
and Extended Data Table 5), a protein that is important for the assem- 
bly and/or stability of the membrane domain*’. A second cluster of three 
TMHs (opposite B14.7) may include two TMHs from B14.5b (NDUFC2), 
but the connectivity between them is ambiguous and a clear assign- 
ment cannot be made. The 11 remaining TMHs are spread around the 
membrane domain (Fig. 4). Three TMH-containing subunits, B16.6 
(NDUFA13), MWFE (NDUFA1) and B9 (NDUFA3), remain assoc- 
iated with the hydrophilic domain in subcomplex Io after fractiona- 
tion of B. taurus complex I with zwitterionic detergents’, and they are 
missing from Y. lipolytica subcomplex Id, which lacks core subunits 
NDI, 2,3 and 4L™. Therefore, they are assigned to the three TMH den- 
sities next to ND1 (Extended Data Table 4 and 5). Sequence analyses 
predict a single 67-residue helix in subunit B16.6, with the first 20 res- 
idues forming a TMH. Correspondingly, one of the three densities is 
very long and modelled as a single 63-residue helix that interacts with 
the N terminus of TYKY on the matrix side, spans the membrane, then 
bends into the intermembrane space and is anchored under the ‘heel’ 
(Figs 4 and 5). Therefore, this density is assigned to B16.6, a protein 
identical to the cell death regulatory gene product GRIM19 (ref. 35). It 
is currently not possible to confidently deduce assignments for the TMH- 
containing subunits of subcomplex If. 

The PGIV (NDUFA8), 15 kDa (NDUFSS) and B18 (NDUFB7) sub- 
units contain twin CX,C motifs that form two intramolecular disul- 
phides within “CHCH’ domains”, and they have been assigned to the 
intermembrane surface of complex I (ref. 37). A double L-shaped den- 
sity, resembling two CHCH domains at right angles, is clearly visible 
on the heel, clamping B16.6 (Figs 4 and 5) onto the core. PGIV, a sub- 
unit present in subcomplex Io (Extended Data Table 4), contains two 
CHCH domains, so it is assigned to the L-shaped density feature (Figs 4 
and 5), consistent with the position of an antibody label to its homol- 
ogous subunit in Y. lipolytica**. Our structure thus reveals the architec- 
ture of the ‘400 kDa’ assembly intermediate of human complex I (refs 18, 33) 
that contains the core hydrophilic subunits 49 kDa, 30 kDa, PSST and 
TYKY, core membrane subunit ND1, and the supernumerary subu- 
nits PGIV, B9, B16.6 and B13. 


Conclusions and perspectives 


We have described a 5 A resolution cryo-EM density map for mam- 
malian complex I, and used it to produce structural models for the 14 
core subunits that are conserved in all complex I enzymes, plus 14 of the 
supernumerary subunits of the mammalian enzyme. The core subunits 
comprise the catalytically active centre of the enzyme, and (as expected) 
they closely resemble their counterparts described by the atomic-resolution 
structure of bacterial complex I (ref. 4). The 14 supernumerary sub- 
units assigned include two copies of subunit SDAP, bringing the total 
number of subunits in the mammalian complex up to 45. We have used 
our structural models for the supernumerary subunits to support and 
discuss their roles in assembly, homeostasis and regulation. Higher- 
resolution maps are required for assignment of the remaining 17 super- 
numerary subunits. 

Recent developments in direct electron detectors, microscopy and 
image processing algorithms have enabled high-resolution structures 
of biological macromolecules to be determined by single-particle cryo- 
EM at resolutions that have previously only been routinely possible with 
X-ray crystallography***°. Thus, we believe that it will be possible to 
extend our current study to produce a high-resolution structure for com- 
plex I in the near future, to allow us to identify and model all the super- 
numerary subunits, and to characterize the structural changes that occur 
during catalysis, a crucial step in defining the mechanism of electron- 
coupled proton translocation. 
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METHODS 


Protein preparation. Complex I was purified from B. taurus heart mitochondrial 
membranes by solubilization and anion exchange chromatography in n-dodecyl-B- 
D-maltoside (DDM), and size-exclusion chromatography in DDM or 7-cyclohexyl- 
1-heptyl-B-p-maltoside (Cymal 7) as described previously”’. 

Cryo-EM specimen preparation and imaging. Aliquots of complex I (3 ul, 3- 
4.5 mg ml _') were applied to glow-discharged holey-carbon Quantifoil R 0.6/1 grids, 
blotted for 15-18 s, then plunge-frozen in liquid ethane using an environmental 
plunge-freeze apparatus”’. The grids were transferred into cartridges, loaded into 
an FEI Titan Krios electron microscope, and images were recorded at 2-5 1m under- 
focus on a Falcon II CMOS (complementary metal oxide semiconductor) direct 
electron detector at 300 keV at X81,495 magnification (nominally < 47,000), with 
the specimen temperature at — 186 °C using the EPU software (Extended Data Fig. 1). 
The detector pixel size of 14 um corresponds to a sampling density of ~1.72 A 
pixel” '. Each image was exposed for 4 s (total dose ~64e A”) and 72 frames were 
captured as previously described’>. For tilt-pair analysis, the same area was imaged 
twice, at 0° for 0.8s and then at 10° for 2.0, and analysed with FREALIGN”. 
Image processing and three-dimensional reconstruction. An initial data set for 
complex I in DDM was obtained by manually picking particles with XIMDISP*. 
Seven-thousand six-hundred and thirty particles, from 366 micrographs, were used 
to generate initial maps in EMAN2 (ref. 44). Particles were boxed in 280 X 280 pixels 
and contrast transfer function (CTF) parameters estimated internally. Reference- 
free classification was performed using the default EMAN2 parameters, and classes 
with distinct orientations selected (see Extended Data Fig. 1b for an example) to build 
initial models. The initial model that best matched the class averages was selected 
and two cycles of refinement performed in EMAN2. Subsequently, comparison of 
the model with the structure of complex I from T. thermophilus (Protein Data Bank 
(PDB) accession 4HEA*) suggested that it had the wrong hand; this observation 
was verified using tilt-pair analysis* and corrected (Extended Data Fig. 2). All fur- 
ther refinements were performed in RELION™, starting with maps that were low- 
pass filtered to 60 A. 

Typical micrographs prepared from complex I in Cymal 7 exhibited, on aver- 
age, twice as many particles (~40 per micrograph) than those from complex I in 
DDM, so a larger data set was collected using Cymal 7. The reason for the differ- 
ence in particle distribution is not clear—it may simply be a product of the grid 
preparation and freezing protocols. The class averages were used as a reference to 
pick particles automatically using RELION but many false positives were included, 
so all the images were inspected manually and particles too close to each other, 
aggregates and ice contaminants were deleted. The final data set contained 45,618 
particles from 1,154 micrographs. The CTF was determined with CTFFIND3 (ref. 46) 
using the images summed from all 72 frames. Subsequently, refinement was per- 
formed using frames 1-32 of each image (the last 40 frames were discarded), to pro- 
duce a map with resolution of 5.86 A and orientational accuracy of 1.2°. To check 
for overfitting, phases were randomized beyond 10 A on individual images (frames 
1-32), followed by refinement as for the normal images’. The results clearly show 
the presence of information beyond 10 A (Extended Data Fig. 2). Note that RELION 
divides the data set into two halves at the initial step and calculates the resolution 
using a gold-standard Fourier shell correlation (FSC), so the phase randomization 
procedure serves here only as an additional control. 

Modelling of the beam-induced movement of the complex I particles (using a 
running average of 11 movie frames in RELION) provided a modest improvement 
in resolution to 5.16 A. The parameters from this analysis were then used to carry 
out a per-frame reconstruction in RELION (particle-polish), and a B-factor weight- 
ing was applied to each frame, resulting in a 4.8 A map. The B-factor-weighted 
particles were subjected to 25 iterations of three-dimensional classification into 4 
classes; this separated a major class containing 55% of the particles from smaller 
classes of 24, 14 and 6%. Difference maps revealed minor localized variations in 
some of the peripheral regions of the molecule, but no large-scale conformational 
variation was observed. The major class with 25,492 particles was refined and, after 
post-processing with RELION, a shape-mask, correction for the modulation trans- 
fer function (MTF) of the detector and a B-factor of — 152 (ref. 48) were applied, 
and filtered to 4.95 A resolution (Extended Data Fig. 2C, note that the magnifica- 
tion and CTF values have not been refined). Despite containing a lower number of 
particles, the maps from this major class and the whole data set were comparable. 
Analysis of the local resolution by ResMap*” showed that the core sections of the 
molecule, particularly the TMHs, have higher resolutions than the peripheral sec- 
tions, and that (as expected) the detergent-phospholipid belt is at lower resolution 
(Extended Data Fig. 2e). 

Model building. Model building was performed using Coot*’. All the models de- 
scribed have been built as polyalanine chains, except for the residues that coordinate 
the FeS clusters. Note that the present model has not been refined, so it inevitably 
contains some errors and inaccuracies. Examples of the model fitted to the electron 
density are shown in Extended Data Fig. 3, and figures were created using the 
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PyMOL Molecular Graphics System or UCSF Chimera (http://www.cgl.ucsf.edu/ 
chimera/). 

The seven core hydrophilic subunits of B. taurus complex I were modelled ini- 
tially using the coordinates of T. thermophilus complex I (PDB accession 4HEA‘) 
as a template, trimmed where the densities were ambiguous, and adjusted manu- 
ally. FeS clusters were located using the highest peak densities in the unsharpened 
map, and the subunits were built around them. The 24 and 51 kDa subunits were 
easily built as they are well conserved and the connectivity in the densities is clearly 
resolved. The 49 kDa subunit has dominant secondary structures and, except for 
the N-terminal peptide, could be completely traced. Similarly, the PSST and TYKY 
subunits, with three FeS clusters, were readily built. The central part of the 30 kDa 
subunit, containing a mixture of a-helices and B-strands, could be traced, but the 
path of the long unstructured C terminus is unclear. The 75 kDa subunit is the least 
conserved hydrophilic core subunit. The small domain containing the three FeS 
clusters is well resolved and could be traced easily using the T. thermophilus model, 
but considerable portions of the large, peripheral domain have poor density, low 
secondary structure content and low sequence similarity to T. thermophilus, and so 
could not be traced confidently. The seven core subunits in the membrane domain 
could all be readily traced, except for a few loop regions, and assigned using their 
similarity to the T. thermophilus subunits. The long transverse helix at the C ter- 
minus of subunit NDS is well ordered and extends over ND4 and ND2. In better 
resolved regions of the map, protruding densities that are likely to be side chains of 
aromatic residues are observed (see Extended Data Fig. 3), and these features, along 
with secondary structure information and sequence alignments, were used to pro- 
duce an optimized assignment for the B. taurus residue numbers in the modelled 
subunits (Extended Data Table 2), for use as a guide to the positions of individual 
residues. 

Once the electron density for the core subunits had been assigned, models for 
the TMHs of the supernumerary subunits were built. A total of 18 TMHs were 
modelled, and when the density was clear they were extended. Connectivity was 
observed between four TMHs adjacent to subunit ND4 so they were combined into 
a single chain. To aid in supernumerary subunit assignments, the secondary struc- 
ture of each subunit was predicted using PSIPRED* and TMHs were predicted 
using TMHMM2 (ref. 52), HMMTOP2 (ref. 53) and the TOPCONS suite™* (seven 
methods in total) (Extended Data Table 4). Known structures of soluble proteins 
with high homology to the complex I supernumerary subunits were identified by 
HHpred* and used to build homology models in Modeller*® and SwissModel” (Ex- 
tended Data Table 5). Regions of the density map with features corresponding to 
the predicted structures were located manually. Long loop regions were trimmed, 
then the models were placed in the density, jiggle fit in Coot was used to find the 
best fit, and the models were adjusted manually. Finally, several additional tubular 
densities in the map were built as «-helices. Most of them are located close to 
TMHs from the supernumerary subunits, but the connectivity to them is not clear; 
a higher resolution map will be necessary to assign these helices to their respective 
subunits. 
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Extended Data Figure 1 | Single-particle cryo-EM analysis of B. taurus 
complex I. a, Typical micrograph of complex I particles imaged after freezing 
in vitreous ice on a holey-carbon grid. Some of the selected particles are 
marked with red boxes. Scale bar, 50 nm. b, Two-dimensional reference 
classification showing particles lying in different orientations in the ice. The size 
of each box is 280 pixels and the two-dimensional classification was made 

in RELION™. 
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Extended Data Figure 2 | Validation of the map and resolution. a, Tilt-pair 
analysis** of complex I in Cymal-7. One-hundred complex I particles from 
eight image pairs, recorded with a relative tilt angle of 10°, were extracted 
and subjected to tilt-pair analysis with FREALIGN™. The outer radius of the 
plot is 40° and the orange circle centred at the expected tilt angle has a radius of 
6°. b, Phase randomization to check for overfitting. Phases that are beyond 
10A in each of the micrographs used in the final data set (frames 1-32) were 
randomized, and then refinement was performed as for a normal data set 
(FSC summed image corresponding to frames 1-32). As expected, the graph 
shows a drop in the Fourier shell correlation (FSC) curve at 10 A, validating the 
presence of information beyond 10 A in the images. Note that the use of 
gold-standard refinement procedures in RELION™ prevents any overfitting, 
and this test was done only as an additional control. c, An overview of the final 
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map and the model built into it. d, FSC curves of the final map and of the 
model versus the map. The curve in red is the gold-standard FSC of the final 
map (after classification) and the resolution at FSC = 0.143 is ~4.95 A. The 
curve in cyan is the FSC between the final map and the model, and at FSC = 0.5 
the resolution is 6.7 A. Note that the present model is not complete since 

it is only a polyalanine model without any side chains, and loop regions in a 
number of subunits have not been modelled. e, The final map of mammalian 
complex I was analysed with ResMap™. The left-hand panel (with lower 
density threshold) shows that the detergent-phospholipid belt is of lower 
resolution, and most of the protein regions of the map show resolution 
distributed from 5 to 6 A. In the right-hand panel the map is shown at a higher 
density threshold, so the detergent-phospholipid belt is not visualized. Some of 
the interior parts of the map have resolution of 4.8-5 A. 
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Extended Data Figure 3 | Example regions of the density map with the 
model fitted to the map. a, ND2 is shown from the membrane plane, 
highlighting the densities for three aromatic side chains and one of the 
helix-breaking loops. b, Subunit ND4 viewed from the matrix. c, The density for 
a [4Fe-4S] cluster and surrounding protein is shown in the PSST subunit. 

d, A region of the 49 kDa subunit shows a well resolved «-helical stretch and 
aromatic side chains, and the B-strands are beginning to be resolved. e, Subunit 
B8 is an example of a supernumerary subunit in a peripheral region of the 
molecule. f, Density consistent with a bound nucleotide is observed in the 

39 kDa subunit, in a similar position to in homologous structures and as 
expected from analysis of Y. lipolytica complex I (ref. 27). However, the present 
resolution of the map precludes the inclusion of this nucleotide in the 

final model. 
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(NDUFV1 & FV2) 


Extended Data Figure 4 | Global comparison of the core subunit structures 
of bacterial and mammalian complex I. The core subunits from B. taurus are 
in blue, and from T. thermophilus (PDB accession 4HEA") in orange. The 
structures have been superimposed using ND1 (the heel subunit). Top: the 
ND2, ND4 and ND5 domain is rotated in B. taurus relative to in T. 
thermophilus, increasing the curvature in the B. taurus membrane domain. The 
complex is viewed along the 11° rotation vector (orange) that maps the T. 
thermophilus ND2, ND4 and ND5 domain to the B. taurus domain, along with 
a small 5 A translation to superimpose the domain centres. Correspondingly, 
the ND3, ND4L and ND6 domains are superimposed by a 4° rotation and a 
1 A translation. Rotation of ND2, 4 and 5 about the long axis of the domain, as 
noted for Y. lipolytica®, is not observed. Bottom: the NADH dehydrogenase 
domain containing the 51 and 24 kDa subunits is rotated by 23° and translated 
by 14A in B. taurus, relative to in T. thermophilus, causing the FeS chains to 
diverge as the distance from ND1 increases. A similar rotation was observed in 
Y. lipolytica®*. The complex is viewed from behind ND1. Correspondingly, 
the 49 kDa, PSST and TYKY subunits are superimposed by a 6° rotation and a 
2A translation. The structures were analysed using Superpose from the 
CCP4 suite” and the 75 kDa and 30 kDa subunits were not included due to 
their lower structural conservation. 
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Extended Data Figure 5 | Comparison of the individual structures of the differences in the structures of the core subunits of B. taurus and 

core subunits of bacterial and mammalian complex I. a, The structure of T. thermophilus complexes I. Grey, conserved structure from B. taurus and 
each subunit from T. thermophilus (wheat) (PDB accession 4HEA‘*) has T. thermophilus (PDB accession 4HEA‘); red, structural elements present only 
been superimposed separately on its corresponding subunit from B. taurus in T. thermophilus; blue, structural elements present only in B. taurus. The 
(coloured as labelled) with the transverse helix plus TMH16 of ND5 also C-terminal domain of the 75 kDa subunit is not resolved in B. taurus, but its 


aligned separately. The complexes are viewed from behind ND1 (top), from the structure is clearly different to in T. thermophilus. 
side (middle) and from the matrix (bottom, ND subunits only). b, Observed 
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Extended Data Table 1 | Reference table for the nomenclature of the core subunits of complex | 


Chain Homo Yarrowia Thermus Escherichia 

Domain identifier Bos taurus sapiens lipolytica thermophilus coli 
G 75 kDa NDUFS1 NUAM Nqo3 NuoG 
F 51 kDa NDUFV1 NUBM Nqo1 NuoF 
D 49 kDa NDUFS2 NUCM Nqo4 

iyaop hie Cc 30 kD NDUFS3 NUGM - 5 eee 

domain a qo 
E 24 kDa NDUFV2 NUHM Nqo2 NuoE 
B PSST NDUFS7 NUKM Nqo6 NuoB 
I TYKY NDUFS8 NUIM Nqo9 Nuol 
H ND1 ND1 NU1M Nqo8 NuoH 
N ND2 ND2 NU2M Nqo14 NuoN 
A ND3 ND3 NU3M Nqo7 NuoA 

Membrane 

domain M ND4 ND4 NU4M Nqo13 NuoM 
L ND5 ND5 NU5M Nqo12 NuoL 
J ND6 ND6 NU6M Nqo10 NuoJ 
K ND4L ND4L NULM Nqo11 NuokK 


In the text the names of the subunits from B. taurus are used, with the names from the human enzyme presented alongside as appropriate. 
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Extended Data Table 2 | Summary of the models of the core subunits of B. taurus complex | 


Subunit 


ND1 


ND2 


ND3 


ND4 


ND4L 


NDS 


ND6 


75 kDa 
NDUFS1 


51 kDa 
NDUFV1 


49 kDa 


NDUFS2 


30 kDa 
NDUFS3 


24 kDa 
NDUFV2 


PSST 
NDUFS7 


TYKY 
NDUFS8 


Total 
residues 


318 


347 


115 


459 


98 


606 


175 


704 


444 


430 


228 


217 


179 


176 


Modelled 
residues 


3 - 200 
219 - 242 
253 - 315 


2-300 
320 - 346 


4-22 

28 - 358 
363 - 400 
408 - 466 
487 - 513 
520 - 604 


2-76 
85 - 107 
140 - 172 


8-125 

136 - 318 
326 - 347 
367 - 400 
404 - 410 
425 - 495 
525 - 530 
542 - 627 


31-441 


47 - 430 


15 - 168 


20-178 


27 - 169 


15- 176 


Poorly resolved / uncertain 
residue numbering 


TMH11 


TMH14 


TMH1 


TMH13 & TMH14 
TMH15 
Transverse helix & TMH16 


TMHS 


The large domain (222 - 704) 

is generally poorly resolved. 

The sequence alignment is 

weak and the secondary structure 
content low. Residues 404 - 629 
are particularly poorly resolved. 


Flavin and NADH binding site 
(63 - 72, 99 - 104, 181 - 189, 
300 - 304, 327 - 333) 
3-strand B-sheet (47 - 79) 
Numbering uncertain to 72 


Loop / B-strand (73 - 83) 


Loop 126 - 132 


Loop 68 - 79 


Unresolved 
residues 


1-2 

201 - 218 
243 - 252 
316 - 318 


1 
301 - 319 
347 


1 
24-51 
113 - 115 


1-2 
416 - 429 
456 - 459 


85 - 98 


77 - 84 
108 - 139 
173-175 


1-7 

126 - 135 
319 - 325 
348 - 366 
400 - 403 
411 - 424 
496 - 524 
531 - 541 
628 - 704 


1-30 


442 - 444 


1-46 


1-14 
169 - 228 


1-19 
179-217 


1-26 
170 - 179 


1-14 


*For proteins with a mitochondrial-targeting pre-sequence, residue 1 is the first residue of the mature protein?°. 


+ The percentage identity and the root mean squared deviation (r.m.s.d., calculated using PDBeFOLD*®) are between the sequences and structures of the subunits of B. taurus and T. thermophilus (PDB accession 


AHEA) complex |. 


Unresolved 
elements 
(>10 residues) 


Matrix loop (TMH 5 - 6) 


IMS loop (TMH 6 - 7) 


Matrix loop (TMH 10 - 11) 


Matrix loop (TMH 1 - 2) 


Matrix loop (TMH 13 - 14) 


Matrix loop (C-terminus) 


Matrix loop (TMH 1-2) 

Matrix loop (TMH 11-12) 

IMS loop (TMH 12-13) 

IMS loop (TMH 14-15) 

TMH‘15 to transverse 
helix 


Matrix loop (TMH 3 - 4) 
IMS loop (TMH 4 - 5) 


Probable loop region 
Probable loop region 
Probable loop region 
Probable loop region 
Probable subdomain 


N-terminal peptide 


N-terminal region 


N-terminal peptide 
C-terminal region 


N-terminal peptide 
C-terminal region 


N-terminal peptide 


N-terminal peptide 
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%Modelled 


90% 
(285/318) 


94% 
(326/347) 


72% 
(83/115) 


96% 
(439/459) 


86% 
(84/98) 


92% 
(558/606) 


75% 
(131/175) 


75% 
(527/704) 


1 - 221: 
92% 


222 - 704: 
67% 


93% 
(411/444) 


89% 
(384/430) 


68% 
(154/228) 


73% 
(159/217) 


80% 
(143/179) 


92% 
(162/176) 


ARTICLE 


%ldentityt 


42% 
(132/318) 


25% 
(86/347) 


27% 
(31/115) 


24% 
(111/459) 


21% 
(21/98) 


31% 
(187/606) 


16% 
(28/175) 


27% 
(189/704) 


1-221 
40% 


222 - 704: 
21% 


43% 
(191/444) 


42% 
(179/430) 


24% 
(54/228) 


27% 
(59/217) 


49% 
(88/179) 


36% 
(63/176) 


r.m.s.d.t 


1.60A 


2.08 A 


2.05 A 


2.20A 


2.66A 


2.53A 


1.83 A 


1.96 A 
1-221: 
157A 
222 - 704: 
211A 


1.61A 


141A 


1.66A 


1.57A 


144A 


1.89A 


ARTICLE 


Extended Data Table 3 | Distances between the redox cofactors in structural models of complex | 


T. thermophilus B. taurus 
hydrophilic domain complex | complex | 
(2FUG.pdb’) (4HEA.pdb*) (this work) 
Cofactors* centret edget centre? edget centret edget 
Nia - Flavin 15.4 123 15.9 13.1 15.9t 13.14 
Flavin - cluster 1 (N3) 12.5 7.6 12.2 7.3 12.2% 7.24 
Nia - cluster 1 (N3) 22.1 19.4 22.3 19.7 21.1 18.0 
Cluster 1 (N3) - cluster 2 14.0 11.0 13.7 10.7 14.0 11.0 
Cluster 1 (N3) - cluster 3 17.4 13.8 17.1 13.4 18.4 14.5 
Cluster 2 - cluster 3 13.5 10.7 13.0 9.9 127 o7 
Cluster 3 - cluster 4 12.2 8.5 12.4 8.6 12.8 8.7 
Cluster 4 - cluster 5 16.8 14.0 16.5 13.6 16.8 14.0 
Cluster 5 - cluster 6 124 9.4 12.1 9.3 12.1 9.3 
Cluster 6 - cluster 7 (N2) 13.7 10.5 13.5 10.2 13.6 10.5 
Cluster 1 (N3) - cluster 7 (N2) 61.1 57.6 60.5 57.0 61.5 58.1 


* The [2Fe-2S] cluster in the 24 kDa subunit (known as N1a) is on the other side of the flavin from the main cofactor chain. The [4Fe-4S] cluster in the 51 kDa subunit (known as N3) is the first cluster in the chain 
and the [4Fe—-4S] cluster in subunit PSST (known as N2) is the last (Seventh) cluster in the chain. 

+The distances are in A, between the geometric centres of the Fe and S cluster cores or the flavin isoalloxazine ring system (centre), or between the centres of the two closest atoms (edge) as commonly used in 
calculations of electron transfer rates. Distances are estimated to be accurate to within 1A. 

£The position of the flavin in B. taurus is poorly resolved and has been approximated using its position in PDB accession 4HEA. 
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Extended Data Table 4 | Knowledge about the supernumerary subunits of B. taurus complex | 


B. taurus  H. sapiens : Predicted 

subunit* subunit* Subcomplex’ Sequence information TMHs+ 

10 kDa NDUFV3 la and IA 0 

18 kDa NDUFS4 la and IA 0 

15 kDa NDUFS5 la only CXC motif, intermembrane space?” 0 

13 kDa NDUFS6 la and IA PFAM zinc-finger motif 0 
CXgHX,,CX,C 

MWFE NDUFA1 la only 1 

B8 NDUFA2 la and IA 0 

BO NDUFA3 la only 1 

B13 NDUFAS la and IA 0 

B14 NDUFA6 la only LYR motifs2 0 

B14.5a NDUFA7 la and IA 0 

PGIV NDUFA8 la only Two CX,C motifs, PFAMCHCH domain 0 
intermembrane space?” 

39 kDa NDUFAQ la only Short-chain dehydrogenase 0 
reductase family, NADP binding2627 

42 kDa NDUFA10 la only (low level) — Similarity to deoxynucleoside 0 
kinases24 

B14.7 NDUFA11 la (IA at low level) 3 or 4 

B17.2 NDUFA12 la and IA 0 

B16.6 NDUFA13 la and IA 1 

SDAP NDUFAB1 both la and Ip Acyl-carrier protein29:30 0 

MNLL NDUFB1 Ip 0 (or 1) 

AGGG NDUFB2 Ip 1 (or 0) 

B12 NDUFB3 Ip ‘l 

B15 NDUFB4 both la and Ip 1 

SGDH NDUFBS5 Ip 1 

B17 NDUFB6 Ip ‘l 

B18 NDUFB7 Ip CX,C motif, intermembrane space” 0 

ASHI NDUFB8 Ip 1 

B22 NDUFB9 Ip LYR motif92 0 

PDSW NDUFB10 Ip 0 

ESss NDUFB11 Ip 1 

KFYI NDUFC1 none 1 

B14.5b NDUFC2 IB (low level) 1or2 


*The former subunit MLRQ (NDUFA4) is no longer considered a subunit of complex | (ref. 60). 
+ Subcomplex Id, which contains the 7 hydrophilic core subunits and 8-9 supernumerary subunits, represents a considerable portion of the hydrophilic domain of complex |. Subcomplex la, which contains all the 
subunits of subcomplex IA plus core subunit ND6 and 9-10 additional supernumerary subunits, represents the hydrophilic domain of complex | plus associated membrane subunits. Subcomplex IB, which 

contains ND4 and ND5 and 12-13 supernumerary subunits, represents part of the membrane domain’. 
+ TMHs were predicted using TMHMM2 (ref. 52), HMMTOP2 (ref. 53) and the TOPCONS suite®* (seven methods in total) and are presented as consensus values with less represented values in brackets and single 


outliers discarded. 
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Extended Data Table 5 | Summary of the models of the supernumerary subunits of B. taurus complex | 


Subunit 


42 kDa 
NDUFA10 


39 kDa 
NDUFA9Q 


18 kDa& 
NDUFS4 


13 kDa8 
NDUFS6 


B8 
NDUFA2 


SDAP-o. 
NDUFAB1 


SDAP-B 
NDUFAB1 


B138 
NDUFAS5 


B14 
NDUFA6 


PGIV 
NDUFA8 


B14.7 
NDUFA11 


B16.6 
NDUFA13 


B9§ 
NDUFA3 
or 
MWEFES 
NDUFA1 


*For proteins with a mitochondrial-targeting pre-sequence, residue 1 is the first residue of the mature protein??. 


Chain 
identifier 


oO 


Total 
residues* 


320 


345 


133 


96 


99 


88 


88 


116 


128 


AZ 


141 


144 


154 


PDB modelt 


20CP®! 


2Q1Wé2 


2JYA 


2JRR 


1S3A20 


1F 8083 


1F80% 


2LQL°° 


Aligned 
residues 


21 - 252 


19 - 325 


33 - 133 


44-96 


1-99 


35 - 114 


+ Known structures with high homology to the complex | subunits”® °& ®-© were identified by HHpred®°. 
{The percentage identity and the r.m.s.d. (calculated using PDBeFOLD*®) are between the sequences and structures of the subunits of B. taurus complex | and the PDB models. 


§ Subunit with less certain assignment. 


Residue numbers are arbitrary and not assigned to the sequence. 


%ldentity 


21% 
(49/232) 


13% 
(41/307) 


37% 
(37/101) 


34% 
(18/53) 


94% 
(93/99) 


36% 
(28/77) 


36% 
(28/77) 


23% 
(18/80) 


Modelled 
residues 


22 - 54 

79 - 167 
172 - 210 
222 - 241 
19-185 
203 - 250 
285 - 321 


33 - 59 
76 - 116 
47 - 93 


17 - 96 


1-801 


1-1061 


33 - 97 


1-291 


1-429 
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%Modelled 


57% 
(181/320) 


73% 
(252/345) 


52% 
(69/133) 


49% 
(47/96) 


81% 
(80/99) 


81% 
(71/88) 


85% 
(75/88) 


61% 
(71/116) 


56% 
(72/128) 


46% 
(79/172) 


75% 
(106/141) 


45% 
(65/144) 


46% 
(71/154) 


r.m.s.d.# 


191A 


2.52A 


2.42A 


1.97A 


2.18A 


1.18A 


1.36A 


2.40A 
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Turbulent heating in galaxy clusters brightest 


in X-rays 


I. Zhuravleva’, E. Churazov?*, A. A. Schekochihin®”®, S. W. Allen’, P. Arévalo®’, A. C. Fabian!®, W. R. Forman", J. S. Sanders’”, 


A. Simionescu!’, R. Sunyaev*, A. Vikhlinin'' & N. Werner’? 


The hot (10” to 10° kelvin), X-ray-emitting intracluster medium (ICM) 
is the dominant baryonic constituent of clusters of galaxies. In the 
cores of many clusters, radiative energy losses from the ICM occur 
on timescales much shorter than the age of the system’ *. Unchecked, 
this cooling would lead to massive accumulations of cold gas and 
vigorous star formation’, in contradiction to observations’. Various 
sources of energy capable of compensating for these cooling losses 
have been proposed, the most promising being heating by the super- 
massive black holes in the central galaxies, through inflation of bub- 
bles of relativistic plasma®”. Regardless of the original source of energy, 
the question of how this energy is transferred to the ICM remains 
open. Here we present a plausible solution to this question based on 
deep X-ray data and a new data analysis method that enable us to eval- 
uate directly the ICM heating rate from the dissipation of turbulence. 
We find that turbulent heating is sufficient to offset radiative cool- 
ing and indeed appears to balance it locally at each radius—it may 
therefore be the key element in resolving the gas cooling problem 
in cluster cores and, more universally, in the atmospheres of X-ray- 
emitting, gas-rich systems on scales from galaxy clusters to groups 
and elliptical galaxies. 

Perseus and Virgo (also known as M87) are well-studied, nearby, cool- 
core clusters of galaxies in which the central cooling times, owing to the 
emission of X-rays, are an order of magnitude shorter than the Hubble 
time (Methods and Extended Data Fig. 1). X-ray observations show that 
the ICM in central regions of these clusters is disturbed, suggesting that 
it might be turbulent. The most likely drivers of this turbulence are mechan- 
ically powerful active galactic nuclei (AGNs) in the central galaxies of 
both clusters, which inflate bubbles of relativistic plasma in the ICM. 
During the inflation and subsequent buoyant rise of these bubbles, internal 
waves and turbulent motion in the gas can be excited'*’ and must even- 
tually dissipate into heat. To determine whether this heating is sufficient 
to balance radiative losses and prevent net cooling, one must estimate the 
turbulent heating rate—and for that, a measurement is needed of the 
root mean squared turbulent velocity amplitude V as a function of length 
scale I. Then the turbulent heating rate in the gas with mass density p is 
(dimensionally) Quit ~ pV°/l to within some constant of order unity 
that depends on the exact properties of the turbulent cascade. (We use 
a tilde between two quantities to indicate order-of-magnitude equiva- 
lence.) Qturb has never previously been probed directly, mainly because 
of two difficulties. In this Letter, we propose ways of overcoming both, 
leading to an observational estimate of Qiu and the tentative conclu- 
sion that it is sufficient to reheat the ICM. 

The energy resolution of current X-ray observatories is insufficient 
to measure gas velocities in the ICM, or their dependence on scale. Here 
we circumvent this problem by instead measuring gas density fluctua- 
tions and inferring from their power spectrum the power spectrum of 


the velocities. A simple theoretical argument, supported by numerical 
simulations, shows that in relaxed galaxy clusters, where the gas motions 
are subsonic, the root mean squared amplitudes of the density and one- 
component velocity fluctuations are proportional to each other on each 
scale ! = k~' within the inertial range'*"*: 5;/py ~ 11, Vi,4/Cg) Where po 
is the mean gas density, c, is the sound speed and 1, is the proportion- 
ality coefficient ~1 set by gravity wave physics on large, buoyancy- 
dominated scales'?. Here we define Vi, by 3Vi,4°/2 = k,E(k,), where 
k, = 2rk is the Fourier wave number and E(k) is the energy spectrum 
of the three-dimensional velocity field; 5p;/po is defined analogously 
in terms of the density fluctuation spectrum, but without the factor of 
3/2. Unsharp-masked images of the Perseus cluster show ripple-like 
structures in the core, reminiscent of either sound waves!" or stratified 
turbulence’*"” (Methods). Here we investigate the consequences of the 
second of these possibilities (which may be more likely if the stirring of 
the ICM by the AGN ejecta is of sufficiently low frequency). 

The high statistical precision obtained by Chandra ina 1.4 Ms obser- 
vation of the Perseus cluster core makes this data set ideal for probing 
density structures over a range of spatial scales. Figure 1 shows the mosaic 
image and a residual image, made by dividing the mosaic image by a 
spherically symmetric $-model of the mean intensity profile with core 
radius 1.26’ (equivalent to ~26 kpc at the distance of Perseus) and slope 
6 =0.53 (Methods and Extended Data Fig. 2). Using the modified 
A-variance method’, we calculate the power spectra of surface bright- 
ness fluctuations in a set of concentric annuli (Extended Data Fig. 3), 
each with width 1.5’ (31 kpc), and deduce from them the amplitudes of 
density fluctuations across a range of spatial scales. The typical 6p;/po 
at k~' ~ 20 kpc varies from ~20% inside the central 1.5’ (31 kpc) to 
~7% at an angular distance of 10.5’ (~218 kpc) from the cluster centre 
(1.Z. et al., manuscript in preparation). We have also performed a similar 
analysis for a ~600 ks Chandra observation of the Virgo cluster. 

Figure 2 shows examples of the velocity amplitudes Vj, inferred from 
the density amplitudes 5p;/po using the relation 7, Vi4/c., ~ 5p;/pos in 
two different annuli for each of the two clusters. In these examples, over 
the range of spatial scales where the measurements are least affected by 
systematic and statistical uncertainties, V; , varies from ~70 km s ‘to 
~145kms ' in Perseus. In the full set of seven annuli from the centre 
to 10.5’ (~218 kpc), the range of velocities is larger, up to210 kms — lin 
Virgo, the typical velocity amplitudes in all annuli are smaller, between 
43 and 140kms_', but the corresponding spatial scales are smaller too. 

These (inferred) velocity spectra can be used to estimate the heating 
rate Quirb ~ PV°/I. The second difficulty mentioned earlier is that nor- 
mally [here is taken to be the energy-containing scale of the turbulence, 
which is difficult to determine or even define unambiguously: in theory, 
several characteristic scales (for example the distance from the centre, 
various scale heights and the like) are present in the problem”. The 
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Figure 1 | X-ray image of the core of the Perseus cluster. a, X-ray surface 
brightness in units of counts per second per pixel (colour scale), obtained in the 
0.5-3.5 keV energy band from Chandra observations. b, The same divided 
by the mean surface brightness profile, highlighting the relative density 
fluctuations. The images are smoothed with a 3’’ Gaussian. Black circles: 


measured spectra (Fig. 2) do not necessarily offer clarity about the injection 
scale, because at low k they are dominated by large-scale inhomogene- 
ities and the radial width of the chosen annuli. However, in a turbulent 
cascade, the energy spectrum in the inertial range should have a universal 


Virgo cluster 


100 =. 


One-component velocity amplitude (km s~') 


1 
0.1 


Wavenumber (kpc~') 


Figure 2 | Measured amplitude of the one-component velocity Vi,x of gas 
motions versus wavenumber k. The amplitude is shown for two different 
annuli in both Perseus (blue) and Virgo (red). The values are obtained from the 
power spectra of density fluctuations, derived from the X-ray images. The 
wavenumber k is related to the spatial scale ! by k = 1/1. Solid-hatched regions 
show the range of scales where the measurements are least affected by 
systematic and statistical uncertainties (Methods). The width of each curve 
reflects the estimated 1c statistical and stochastic uncertainties. The dashed line 
is the Kolmogorov scaling k”"°. 
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excised point sources (Methods). The redshift is taken to be z = 0.01756 
(redshift of the central galaxy); hence, the angular diameter distance is 72 Mpc 
(for h = 0.72, Qy = 0.3, Qa = 0.7) and 1’ corresponds to a length scale 

of 20.82 kpc. 


form depending only on k and the mean, density-normalized dissipa- 
tion rate é = Qiup/Po. Assuming purely hydrodynamic” turbulence, 
the energy spectrum should be E(k,) = Cxe7?k, °°, where Cx ~ 1.65 is 
the Kolmogorov constant*’”*. The turbulent energy flux on any scale in 
the inertial range will be the same and equal to the mean dissipation rate: 
accounting for our conventions k = 1/1 = k,/2mand V1, = [2k,E(k,)/ 3], 
we obtain Quy = Poe = CopoV ia ks where Cg = 3°72m1/(2Cx)*” = 5 
is a dimensionless constant whose value should be treated as a fiducial 
number. Indeed, although the constant-flux, Kolmogorov-like nature 
of the turbulence is probably a good assumption, the specific constant 
Cg will depend on more detailed properties of the turbulent cascade 
(for example magnetohydrodynamic rather than hydrodynamic”’) and, 
in particular, on the types of fluctuation (velocity, magnetic, density”) 
that carry the total injected energy flux to small scales. We will not be 
concerned here with a precise determination of Cg. It is clearly an order- 
unity number and it is also clear that our estimate for the turbulent heat- 
ing rate can be used only if we identify, for each of the annuli where we 
calculated V;,,,a k interval in which Vj, wk stays approximately constant 
in k. Remarkably, our measured velocities are indeed consistent with 
Vipw~k ', accounting for the errors and uncertainties associated with 
finite resolution and with our method of extracting power spectra’. 

Because of order-unity uncertainties in the determination of Quip 
the question of the heating—cooling balance reduces to whether the local 
Qturb Measured at each radius is comparable within an order of magni- 
tude to the local cooling rate and, more importantly, scales linearly with 
it from radius to radius and between clusters. The answer, as demonstrated 
in Fig. 3, is ‘yes’. Here the gas cooling rate was evaluated directly from 
the measured gas density and temperature T: Qooo1 = Neti Ay (T), where 
n, and n; are the number densities of electrons and ions, respectively, 
and A,,(T) is the normalized gas cooling function’’. We see that, in all 
seven annuli in Perseus and all four in Virgo (which span the cluster 
cores in both cases), Qturb ~ Qcooi over nearly three orders of magnitude 
in the value of either rate (Fig. 3 and Methods). We note that in Virgo 
and Perseus similar levels of Q.go1 and Qrurb are attained at physically 
different distances from the cluster centres. 
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Figure 3 | Turbulent heating (Qheat) versus gas cooling (Q.o01) rates in the 
Perseus and Virgo cores. Each shaded region shows the heating and cooling 
rates estimated in a given annulus (top right, the innermost radius; bottom 
left, the outermost radius; Extended Data Fig. 3). The size of each rectangle 
reflects 1o statistical and stochastic uncertainties in heating, variations of the 
mean gas density and temperature across each annulus (affecting estimates of 
both cooling and heating) and the deviations of the measured spectral slope 
from the Kolmogorov law. 


Although these results are encouraging, the uncertainties associated 
with the above analysis are, admittedly, large (Methods). It is difficult to 
prove unambiguously that we are dealing with a universal turbulent cas- 
cade, because other structures (for example edges of the bubbles, entrain- 
ment of hot bubble matter’’, sound waves'*"®, mergers and gas sloshing”) 
might also contribute to the observed density fluctuation spectra. Rather 
we argue simply that the cluster cores appear disturbed enough that if these 
disturbances are indeed due to turbulence, then its dissipation can reheat 
the gas. At the very least, one may treat the amplitudes calculated here 
(Fig. 2) as an upper limit on the turbulent velocities. One of the major tasks 
for future X-ray observatories, capable of measuring the line-of-sight gas 
velocities directly, will be to verify the accuracy of these velocity amplitudes. 

With this caveat, the approximate balance of cooling and heating 
(Fig. 3) suggests that turbulent dissipation may be the key mechanism 
responsible for compensating for gas cooling losses and keeping cluster 
cores in an approximate steady state. Although AGN activity is not the 
only driver of gas motions (mergers or galaxy wakes can contribute as 
well’*), it is plausible that AGNs have the dominant role in the central 
~100 kpc, where the cooling time is short. If this is true then our results 
support the self-regulated AGN feedback model”, in which unchecked 
cooling causes accelerated accretion onto the central black hole, which 
responds by increasing the mechanical output, presumably in the form 
of bubbles of relativistic plasma. The bubbles then rise buoyantly, excit- 
ing internal waves in particular’; the energy from them is converted 
into turbulence, which cascades to small scales and eventually dissipates, 
reheating the gas. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Data processing. We use Chandra data ObsIDs 3209, 4289, 4946-4953, 6139, 6145, 
6146, 11713-11716, 12025 and 12033-12037 for the Perseus cluster and ObsIDs 
2707, 3717, 5826-5828, 6186, 7210-7212 and 11783 for the Virgo cluster to extract 
projected density fluctuation spectra in a set of radial annuli. The initial data pro- 
cessing has been done following the standard procedure”, applying the most recent 
calibration data. To obtain the thermodynamic properties of both clusters, the spectra 
are deprojected” and fitted in the 0.6-9 keV band, using the XSPEC**”? code and 
APEC plasma model based on ATOMDB version 2.0.1. The spectral modelling 
approximates the emission from each shell as a single-temperature plasma in col- 
lisional equilibrium and assumes a constant metal abundance of half the solar value™. 

The X-ray mosaic image and its reduced counterpart for the Virgo cluster are 
shown in Extended Data Fig. 2. The 0.5-3.5 keV band was used because it contains 
the dominant fraction of the cluster signal and because of the weak temperature 
dependence of the gas emissivity in this band. The image of relative fluctuations is 
obtained by dividing the mosaic image by a spherically symmetric /-model of the 
mean surface brightness profile taking a core radius of 0.34’ = 1.7 kpc anda slope 
of f = 0.39. Point sources have been excised from the images, using circles scaled 
according to the size of the combined point spread function. Extended Data Fig. 3 
shows the set of annuli in Perseus and Virgo in which this analysis was performed. 
Mean profiles. Deprojected radial profiles of the electron number density n, and 
temperature T, are shown in Extended Data Fig. 1 for both clusters. Note that the 
properties of the two clusters are very different. In particular, the density in Virgo is a 
factor of ~3 (or more) lower than in Perseus at radii beyond ~ 10 kpc. The tempera- 
ture in Virgo is also lower, by a factor of 1.5-2 at r~ 10-20 kpc. Yet Quurb ~ Qeoor in 
both clusters, as shown in Fig. 3, suggesting a self-regulated mechanism such as, for 
example, the AGN feedback model”. 

The mean mass density of the gas is po = (ne + nj)um, = Emp n,, where n, = 
(€ — 1)n, is the ion number density and m, is the proton mass. Consider a fully 
ionized plasma with an abundance of heavy elements 0.5 the solar value, 
€ = 1.912 and a mean particle weight of = 0.61. The cooling time is defined as 

3(ne+n;)kgT 3 € kgT 
a > amA(T) 26-1 mAdT) 
function” (erg cm’s_'), kg is the Boltzmann constant and we assume identical ion 
and electron temperatures: T = T, = T;. The sound speed, treating the ICM as an 


5 kpT 


, where A,(T) is the normalized cooling 


ideal monatomic gas, is c; = 


Both foo and c, are plotted in Extended Data Fig. 1 as functions of radius. It is 
manifest that t,o) is shorter than the Hubble time in the central ~100 kpc. Note 
that the cooling time is at least 7-20 times longer than the characteristic free-fall 
time tg in both clusters, defined in terms of the radius r and the gravitational 
acceleration gas ty = (2r/g)'’”. Therefore, thermal instability is at most marginally 
important for the hot gas**. 

The cooling time tf... and the cooling rate Q...) have been calculated using a 
cooling function /,(T) for gas with solar metallicity. This is a conservative choice, 
because the dependence of the cooling function on metallicity is not strong and 
often can be neglected for typical ICM gas temperatures, ~2 X 10’-10° K. Inaddi- 
tion, cluster-core metallicity measurements from X-ray spectra can be biased owing 
to the complexity of the spectral modelling of multi-temperature plasma. By account- 
ing for radial metallicity variations in both clusters (based on the simplest one- 
temperature spectral model) and the consequent variation of the cooling function, 
the cooling rates shown in Fig. 3 and Extended Data Fig. 4 may be lower by a factor 
of 0.8 in Perseus and in the outermost annuli in Virgo, but higher by a factor of 2 in 
the innermost annuli in Virgo. 

Estimates of velocity and density required for heating-cooling balance. It is 
useful to have a-priori estimates of the fluctuation amplitudes required to make a 
heating—cooling balance plausible. Equating Qcoo1 = Net;A,(T) and Quy = Capo Vik 
the characteristic Mach number of the turbulent motions at scale 1 = 1/k becomes 


Vv Pai AT 
Ma V3 Lk va(z ( ‘) m3 cK V3 


Cs cumpyp Ca 


Ne 1/3 G vs l 4/3 
~0.15(G3 =) (a =r) (re) 


Here we have referred all of the equilibrium quantities to their typical order-of- 
magnitude values and used the fact that the normalized cooling function /,,(T) is 
a weak function of the ICM temperature”, allowing us to adopt the mean value 
An * 2.5 X 10°”? ergcm*s”' (for a gas with solar metallicity). Because bubbles 
have typical sizes of ~5-20 kpc (ref. 8), the value / ~ 10 kpc is a reasonable order- 
of-magnitude estimate of the outer scale for the ICM turbulence driven by such 
bubbles in cluster cores. Thus, the dissipation of turbulence with relatively low 


Mach numbers, Ma ~ 0.15, should be sufficient to balance the cooling of the gas 
in cores. 

In view of the relationship 5p;/po ~ 111 V1,4/cs between the amplitudes of density 
and velocity fluctuations'’, these Mach numbers correspond to 6p/p9 ~ 10%. These 
are indeed typical values of density fluctuations we see in galaxy clusters. 

Trivial part of the correlation between heating and cooling. Because the density 
explicitly enters the expressions for both the cooling rate and the turbulent heating 
rate, the linear correlation between these rates seen in Fig. 3 partly reflects the large 
range of mean densities at different radii (Extended Data Fig. 1). To show that the 
correlation is not due solely to this trivial part, we divide both Q.o) and Qiurp by the 
density py and thus obtain the cooling and heating rates per unit mass (ergs_'g') 
(Extended Data Fig. 4). Although the range of values of both rates is now smaller, as 
expected, the correlation between them remains manifest. 

Systematic uncertainties in the measurement of density fluctuation amplitudes. 
Westart with the measurements of the surface brightness fluctuations based on broad- 
band X-ray images” (LZ. et al., manuscript in preparation), using the A-variance 
method'**”. The variance on scale / estimated using this method corresponds to a 
convolution of the original power spectrum with a broad filter. For a Kolmogorov- 
like power spectrum, the method can overestimate'® the amplitude of fluctuations 
by ~25%. 

A more important source of uncertainty in the determination of the density 
power spectrum is the fact that dividing the cluster image into ‘perturbed’ and 
‘unperturbed’ components is ambiguous, especially for a relatively steep perturba- 
tion spectrum like Kolmogorov’s, whose integrated power is dominated by the 
largest scales*”*. The $-model provides a reasonable description of the radial sur- 
face brightness profiles for Perseus and Virgo. It is therefore a sensible starting choice 
of unperturbed cluster model. Of course, more complicated models, for example 
projection of an ellipsoidal /-model or models with more sophisticated radial 
profiles, could be used as well. Adding more flexibility (more fitting parameters) to 
the model allows one to absorb more large-scale features of the image into the 
model surface brightness distribution. The net result of such improved fitting is 
that the measured power in the remaining perturbations will decrease on large 
scales, whereas the small-scale power will be less affected (provided the spectrum, 
E(k), is not steeper than k~°, which would correspond to the spectral tail of a 
smooth large-scale distribution; indeed, all our measured spectra are close to the 
Kolmogorov k *? spectrum, which satisfies this constraint). This would cause 
the power spectrum to flatten at large scales. This model-dependent nature of the 
large scales is a feature of any division of the surface brightness variations into 
unperturbed and perturbed parts, including the case of the simplest /-model. This 
is why we expect that the estimates of the heating power based on turbulence 
measurements on small scales within the inertial range are probably more robust 
than estimates based on larger, outer scales. Our estimate of ¢ is thus not very sus- 
ceptible to the choice of the underlying model of the mean surface brightness profile. 

The reconstruction of the three-dimensional power spectrum of density fluc- 
tuations P3p from the two-dimensional power spectrum of the surface brightness 
fluctuations Pzp is another source of uncertainty. The geometrical factor fyp_.3p 
= Pxp/P3p depends on the radial profile of the surface brightness”*. We use the 
mean value of f:p_.3p for each annulus and conservatively estimate the uncertainties 
by comparing it with the factors for the inner and outer radii of the same annulus. 
The maximal uncertainty does not exceed 20% except for the innermost region 
of Virgo. 

The random nature of density fluctuations is another source of uncertainty. The 
spectra we calculate are based on squared amplitudes averaged over each annulus. 
Given a (expected) large degree of intermittency of density fluctuations and the 
limited spatial extent of the annuli, one might ask how representative and how sta- 
tistically converged (that is, well sampled) such annular averages are. For example, 
analysing fluctuations in small patches within the 3’-4.5’ (62-94 kpc) annulus in 
Perseus, we find 5p,/po at scales k~’ ~ 15 kpc varying in a relatively broad range 
from 3% to 10%. This difficulty in relating the root mean squared turbulence level 
to what happens (and what is observed) in any given location is unavoidable 
because one always observes only a single realization of the fluctuating field. To 
achieve statistical convergence, we perform our averages in relatively wide annuli. 
The results we report are robust in the sense that choosing twice broader annuli 
does not change the conclusions. 

A related problem is associated with the weighting scheme used to calculate the 
amplitude of the fluctuations within each annulus by averaging an image after 
applying a filter that selects perturbations with a given spatial scale. The exposure 
maps of the images are not uniform and the brightness of the cluster itself also 
varies substantially across each annulus. The optimal weighting scheme for the 
reduction of Poisson noise would require the weights to be wy ~ texplo, where texp is 
the exposure map and Ip is the global $-model profile of the surface brightness. This 
means that those parts of the cluster that have higher numbers of counts would have 
larger weights. We have experimented with two other choices of weights: wz ~ texp 
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and w3 = 1. These weights have larger statistical errors, but provide a more uniform 
scheme for evaluating the amplitudes of the surface brightness fluctuations across 
the image. For the analysis reported in this Letter, we used the uniform weight 
w3 = 1. In most cases (except for the innermost regions of the two clusters), the 
uncertainty associated with the choice of the weights does not exceed 20%. 

The vertical width (‘error bars’) of the spectra shown in Fig. 2 and Extended 
Data Fig. 4 reflects the 1o statistical uncertainty. The uncertainties discussed above 
slightly affect the shape of the spectra and may change the normalization by the 
factors estimated above (I.Z. et al., manuscript in preparation). The dark-shaded 
regions of the spectra in Fig. 2 and Extended Data Fig. 4b show the wavenumber 
ranges over which we deem the spectra to be determined reliably—these ranges 
were used to determine the turbulent cascade rate ¢ in the manner described in the 
main text. The high-k limits of these ranges are set by the ‘statistical’ uncertainty 
(Poisson noise) or by the point spread function distortions of the amplitude (in both 
cases the uncertainty is less than 20% in the ‘reliable’ range). At low k, we limit our 
reliable k ranges by the wavenumbers where the spectra start flattening. The shape 
of the spectra at these scales is most probably determined by the presence of several 
characteristic length scales (for example distance from the cluster centre and scale 
heights) and by the large-scale uncertainties inherent in the choice of the underlying 
model of the unperturbed cluster and in using finite-width annular averaging regions. 
This flattening disappears or shifts to smaller k if thicker annuli are used. 
Systematic uncertainties in the density-velocity amplitude conversion. If the 
perturbations of the intracluster gas are small, one expects a linear relationship 
between the velocity V;,, and density 6;/po spectral amplitudes’*: Pi =n) = 
with 7; ~ 1 set by gravity wave physics. This assumes that the injection scale of the 
turbulence is larger than or comparable to the Ozmidov scale*”—the scale on 
which the turbulent eddy turnover timescale becomes shorter than the buoyancy 
(Brunt-Viaisala) timescale (that is, nonlinear advection becomes more important 
than the buoyancy response). Dimensionally, this scale is lp = N-*”7«"”, where 
N=CVJH is the Brunt-Vaisala frequency (H is the hydrostatic equilibrium scale 
height—we have omitted numerical factors and ignored the distinction between 
entropy, pressure and temperature scale heights) and ¢ = Qiurp/po is the turbulent 
cascade rate. The relationship 17, ~ 1 is inherited from large scales on all scales 
1< Io, where the density becomes a passive scalar’’. 

Assuming that radiative cooling is balanced by turbulent heating, Qrurb = Qeoob 
it is possible to make an a priori estimate of Ig by letting é = Q.oo/Po and using the 
local mean thermodynamic properties of the ICM to calculate Qcoop Po and N. We 
have done this for both clusters, for each of the annuli where we subsequently 
calculated Qrury (Extended Data Fig. 4). In all cases, Jo is within the range of scales 
(in some cases, comparable to the largest scales) over which velocity amplitudes were 
measured and used to calculate Qy,,,, and for which the conclusion that Qutb ~ Qeoot 
was drawn. Therefore, our assumption of 7, = 1 is at least self-consistent. 

This assumption is also restricted to the inertial range, that is, to scales larger than 
any dissipative cut-offs. It is interesting to compare the smallest scales that we are 
probing with the Kolmogorov (dissipative) scale Ix = v*/4/e'/4, where v is the kin- 
ematic viscosity calculated for unmagnetized gas (which is approximately the same 
as the parallel viscosity for a magnetized plasma*’). In all regions considered in this 
work, the Kolmogorov scale is much smaller than the smallest scale used by us for 
the determination of the cascade rate. In the regions shown in Extended Data Fig. 4, 
Ik = 0.5 and 2 kpc (kx + 2. and 0.5 kpc’) in the 1.5’-3’ and 3’-4.5’ annuli in Perseus, 
respectively. In the Virgo cluster, Ix ~ 0.3 and 0.8 kpe (kx ~ 3 and 1.3 kpc’ ') in the 
2'-4' and 4'-6' annuli, respectively. 

Cosmological simulations of galaxy clusters confirm that 17, ~ 1 witha scatter of 
30% (ref. 13). Hydrodynamic simulations with controlled driving of turbulence 
also show that 17, ~ 1, provided that thermal conduction is suppressed". The 30% 
scatter in the value of 17, gives a factor of 0.3-2 uncertainty in the heating rate. 

We conclude that the cumulative uncertainty in the estimated heating rate is a 
factor of ~3. Although this uncertainty is large, the approximate agreement between 
heating and cooling rates is an interesting result, reinforced by the fact that the two 
rates are not only numerically comparable to each other but are also linearly cor- 
related. A more rigorous test will become possible with direct measurements of the 
velocity field by future X-ray observatories. 

Theoretical uncertainties: the nature of ripples and ICM heating theories. Unsharp- 
masking of the Perseus image shows rough concentric rings, or ‘ripples’, in the sur- 
face brightness’. The observed morphology of these features, namely narrow in the 
radial direction and wide in the azimuthal direction, suggests two plausible pos- 
sibilities: concentric sound waves’ or stratified turbulence'*””. In the first case, the 
radial scale of the ripples should be determined by the time variability of the central 
AGN activity (factors include the intervals between outbursts, the excitation of mul- 
tiple sound waves by vortices arising during each bubble inflation episode’®, the 
distance from the centre and the ICM properties). In contrast, in the case of stratified 
turbulence, the radial scale Ar will be determined by the ratio of the characteristic 


LETTER 


scale height H in the atmosphere and the velocity amplitude V: Ar ~ HV/c,. Here 
we assume the second scenario and defer the detailed analysis of the nature of the 
substructure to a future publication. 

Many other models of ICM heating, which could in principle offset radiative 

cooling in cluster cores, have been suggested. They differ widely in their presumed 
primary source of energy and in how this energy is channelled to the ICM. A brief 
and incomplete list of the broad classes into which these models fall is as follows: 
(1) source: thermal energy of the cluster gas; channelling mechanism: conductive 
heat flux to the core*!”; 
(2) source: cluster mergers; channelling mechanism: turbulence 
(3) source: galaxy motions; channelling mechanism: turbulence 
(4) source: central AGN; channelling mechanism: shocks and sound waves 
turbulent dissipation’, turbulent mixing”, cosmic rays**””, radiative heating 
mixing of gas between ICM and the hot content of bubbles’” and so on. 

Given the multiplicity of possible scenarios, a detailed discussion and comparison 
of these models or even a complete list of references is beyond the scope of this 
Letter. We refer the reader the reviews in refs 7, 8 and references therein. The con- 
tent of this Letter is focused on the energy channelling mechanism rather than the 
energy source. Note that, along with turbulent dissipation, turbulent heat conduc- 
tion might also have a role in the cooling-heating balance. It can be shown, how- 
ever, that in cluster cores, and assuming either stratified or isotropic turbulence, its 
contribution cannot be much larger than that of the turbulent heating (A.A.S. et al., 
manuscript in preparation). 
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Extended Data Figure 1 | Thermodynamic properties of the Perseus and 
Virgo clusters. Radial profiles of the deprojected electron number density, 
the electron temperature, the cooling (t,.,) and free-fall (ti) times, and the 
sound speed. Red points: data with 1c error bars; black curves: data 
approximations using smooth functions. The increased temperature scatter 
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in the central few kiloparsecs is associated with the presence of multi- 
temperature plasma in cool cores. A two-temperature fit of high-resolution 
XMM-Newton RGS spectra of the core of Virgo suggests an ambient 
temperature there of ~1.6 keV (ref. 54). The smooth-function approximation 
we have chosen therefore approaches this value. 
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Extended Data Figure 2 | X-ray image of the core of the Virgo cluster. and central jet. White circles indicate ‘arm-like’ structures associated with the 
a, X-ray surface brightness in units of counts per second per pixel in the central AGN’s activity, which have also been excised. We adopt 16.9 Mpc as 
0.5-3.5 keV energy band. b, Relative surface brightness fluctuations. Both the distance to the cluster, implying that an angular size of 1' corresponds 


images are smoothed with a 3’’ Gaussian. Black circles: excised point sources _ to a length scale of 4.91 kpc. 
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Extended Data Figure 3 | Set of the radial annuli used in the analysis of the 1.5’ ~ 31 kpc in Perseus (a) and 2’ ~ 9.8 kpc in Virgo (b). The outermost circles 
Perseus and Virgo clusters. The same as Fig. 1b and Extended Data Fig. 1b —_ are 10.5’ ~ 218 kpc and 8’ ~ 39 kpc in Perseus and Virgo, respectively. 

with white circles indicating the annuli used. The width of each annulus is 
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Extended Data Figure 4 | Turbulent heating per unit density versus rates divided by the mass density of gas in each annulus. b, The same as Fig. 2 


radiative cooling per unit density, and the Ozmidov scale in the Perseus and _ with the Ozmidov scale Io = 1/ko = N *”e"” shown for each annulus (vertical 
Virgo clusters. a, The same as Fig. 3, but with the turbulent heating and cooling _ black lines), estimated using & = Qcooi/Po (assuming that Quurb = Qrooi)- 
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Suppression of cooling by strong magnetic fields in 


white dwarf stars 
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Isolated cool white dwarf stars more often have strong magnetic fields 
than young, hotter white dwarfs'*, which has been a puzzle because 
magnetic fields are expected to decay with time** but a cool surface 
suggests that the star is old. In addition, some white dwarfs with 
strong fields vary in brightness as they rotate”"'°, which has been 
variously attributed to surface brightness inhomogeneities similar 
to sunspots*”, chemical inhomogeneities'*'* and other magneto- 
optical effects'*’’. Here we describe optical observations of the bright- 
ness and magnetic field of the cool white dwarf WD 1953-011 taken 
over about eight years, and the results of an analysis of its surface 
temperature and magnetic field distribution. We find that the mag- 
netic field suppresses atmospheric convection, leading to dark spots 
in the most magnetized areas. We also find that strong fields are suf- 
ficient to suppress convection over the entire surface in cool magnetic 
white dwarfs, which inhibits their cooling evolution relative to weakly 
magnetic and non-magnetic white dwarfs, making them appear youn- 
ger than they truly are. This explains the long-standing mystery of 
why magnetic fields are more common amongst cool white dwarfs, 
and implies that the currently accepted ages of strongly magnetic 
white dwarfs are systematically too young. 

The complex magnetic field and photometric variability of the white 
dwarf star WD 1953-011’*”” have been suggested* ” to result from a dark, 
magnetically generated spot on the star’s surface. Using spectropolari- 
metric and photometric observations acquired by our group*''” from 
2001 to 2009, and by other authors””° we reconstructed the distribution 
of surface magnetic field intensity’’ and established’ a physical relation- 
ship between an intense localized magnetic surface feature and a dark 
(cool) spot'’””. Relationships to sunspots were discussed, but remained 
speculative owing to the fundamental differences between atmospheres 
of white dwarfs and solar-type stars. Among these differences, the most 
remarkable are the high densities resulting from the compact nature of 
white dwarfs, their simple chemical compositions and the presence of 
extremely strong global magnetic fields in some stars”’. 

Here we establish a direct relationship between the magnetic field 
strength and temperature distribution on the surface of WD 1953-011. 
The detected cool/dark spot in this interpretation is merely the coolest 
region of a generally smooth temperature distribution associated with 
the global inhibition of convective energy transfer near the stellar 
surface. We modelled the observable photometric variation assuming 
a dependence of the local temperature T,,. at each point on the stellar 
surface and the local modulus of the magnetic field |H]jo< of the form 
Tioc  |H]|icc ’ where y is an arbitrary constant. The best agreement 
between the observed and predicted fluxes was found for y = 0.059 + 
0.004. As can be seen in Fig. 1, the fit is excellent, which favours the pro- 
posed link between the magnetic field intensity and the local temperature 


of the photosphere. Figure 1 presents a sophisticated synthetic picture 
of a white dwarf obtained from observations, with details of its mag- 
netic field and atmosphere. 

The link between the temperature and the magnetic field strength 
suggests that the most plausible explanation for the observed phenom- 
enon is the effect known to operate in sunspots: the inhibition of con- 
vection by strong magnetic fields”". In the majority of white dwarfs with 
hydrogen atmospheres and temperatures below 12,000-14,000 K, con- 
vection transfers a significant fraction of the total energy flux. Modelling 
the atmosphere of WD 1953-011 using the LLmodels stellar atmosphere 
code” we estimate that convection transfers 70%-95% of the total flux 
in subphotospheric layers. This is a substantial fraction of the total flux 
and the suppression of convection should be capable of producing a tem- 
perature decrease comparable to that which causes sunspots. The power 
form of the adopted function Ti.< « |H|joc "independently supports a 
suppression of convection similar to that occurring in sunspots”, and 
can be understood qualitatively—the stronger the field, the deeper the 
convective inhibition, and the cooler the upper photosphere. Using com- 
putations of the extent of the outer convection zone for hydrogen-rich 
stars”’ we also find that the magnetic field of WD 1953-011 is energet- 
ically sufficient to suppress convection over the entire stellar surface, 
and for some distance into the envelope. This, in turn, argues that the 
temperature-magnetic field relationship is valid everywhere over the 
surface. 

Although the physical mechanism that produces cool, dark regions 
seems to be the same in sunspots and cool white dwarfs, there is a fun- 
damental difference. According to theory™, generation of sunspots requires 
the presence of differential rotation and strong outer convection to trans- 
form a weak global magnetic field into strong-field flux tubes. These 
tubes are unstable and migrate over the stellar surface, and ultimately 
decay after tens to hundreds of days. In contrast to sunspots, the mag- 
netic details and associated temperature/brightness distribution patterns 
in WD 1953-011 are stable and do not change over at least ten years’”. 
The global control of surface convection by magnetic field is a funda- 
mentally new observed effect having two important consequences. 

The first consequence is that the majority of convective magnetic white 
dwarfs (MWDs) should demonstrate stable, periodic photometric vari- 
ability. By analysing known photometrically variable magnetic white 
dwarf stars’°, we conclude that such variability is typical for the majority 
of those that are convective. The relationship between the observed” 
semi-amplitudes of the photometric variability of MWDs and their 
surface magnetic fields suggests the existence of two regimes or branches, 
as shown in Fig. 2. Fits to these branches using two different low-order 
polynomial functions demonstrate their statistically significant differ- 
ence. We interpret the first, decreasing branch as due to the presence of 
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Figure 1 | Modelling results for WD 1953-011. a, Tomographic portraits of 


the star’s surface magnetic field (the field modulus, shown in red shades) and 
temperature distribution (greyscale) in planar projection. b, Tomographic 
portraits of the star’s surface magnetic field (red) and temperature distribution 
(greyscale) at different rotation phases ¢ (spherical coordinates). c, Phase variation 
of brightness in stellar magnitudes (on the vertical axis) with rotation period 
P= 1.441788 days of the V-band magnitude of WD 1953-011. Observations are 
shown as black symbols and their uncertainties'”; the model is shown as red symbols. 
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Figure 2 | Dependence of amplitude of the photometric variability of 
MWDs on their surface magnetic field strength B,. MWDs form two 
sequences of variability: radiative and convective stars. The only exception is 
the very hot magnetic white dwarf WD 1658+441, the variability of which 
remains mysterious’. This young degenerate is still forming, and is located far 
before the crystallization stage; we cannot exclude the possibility that it is a 
binary or has an orbiting planet. The star Feige 7 is a unique MWD, the 
variability of which is due to magnetically separated surface abundances of 
helium and hydrogen’*"*. 


global suppression of convection. For a more detailed interpretation of 
Fig. 2 see the Methods. 

The second consequence is that the cooling evolution of white dwarfs 
with the strongest magnetic fields is slowed. This surprising effect explains 
several mysterious statistical characteristics of MWDs. According to 
the white dwarf cooling theory” the characteristic cooling timescale t 
is defined as t = (L/M) *’”, where L and Mare the star’s luminosity and 
mass. As noted above, in white dwarfs convection typically transfers 
70%-95% of the flux from subphotospheric layers to the surface. There- 
fore, strongly magnetic, convective white dwarfs with sufficiently inhib- 
ited convection should typically have much smaller luminosities, and 
therefore longer cooling timescales than their non-magnetic or weakly 
magnetic twins. Following published studies**”’, in the Methods we 
present theoretical details on convection inhibition and qualitatively 
model the process. We argue that convection is inhibited in all cool white 
dwarfs with magnetic fields from hundreds of kilogauss to tens of mega- 
gauss and higher. As a consequence their luminosities are decreased 
(especially for those with the strongest fields), and their cooling timescales 
are increased. (We note that hydrostatic equilibrium is always preserved 
because white dwarfs are cooling remnants without burning cores.) 

The latter conclusion suggests that magnetic white dwarfs must exhibit 
characteristic features in their temperature—age distributions related 
to these phenomena. Indeed, statistical studies demonstrate that the 
observed'* fraction of strongly magnetic stars among cool white dwarfs 
is higher than among the hot ones. Whether this is so has been histor- 
ically debated*”, but the question has recently been settled by the dis- 
coveries of hundreds of magnetic degenerate stars in the Sloan Digital 
Sky Survey*. In Fig. 3a we show the frequency versus effective temper- 
ature of magnetic white dwarfs from this survey, and from a volume- 
limited sample’. The increase in frequency for magnetic stars in the 
convective region is obvious. Having been firmly established, this effect 
is still not understood. 

We suggest that the magnetic suppression of cooling provides a natural 
explanation of the frequency difference between cool (convective) and 
hot (convection-free) MWDs. At the same time, the frequency differ- 
ence is not found among the weak-field stars with fields” of a few 
kilogauss (kG). For these degenerates, convection is not suppressed by 
the field and the cooling suppression does not occur. The basic con- 
sequence is that the slower cooling evolution of cool MWDs changes 
their distribution relative to hot ones, leading the majority of the strong- 
field stars to occur in the low-temperature regime. Using predictions 
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Figure 3 | Some characteristic features in statistical distributions of MWDs. 
a, Frequency dependence of MWDs on temperatures. Filled circles are 
frequencies from the most recent survey*. Triangles are frequencies from other 
studies'*. Uncertainties are obtained by the Monte-Carlo method and taken 
from the literature’. The solid and dashed lines are computations obtained 
assuming inhibition of convection in only the photosphere (dotted line); in the 
upper subphotospheric layers and photosphere (solid line); and the entire 
convective zone (dashed line). b, Distributions of the relative fraction Nmag/N of 
MWDs sorted by their magnetic field strengths into three groups: convective, 
cool MWDs with temperatures below 10,000 K, convective, hot MWDs 

with temperatures from 10,000 K to 14,000 K, and radiative MWDs. Nmag 
and N are the observed numbers of magnetic white dwarfs and all white dwarfs 
in each of these groups. 


of the LLmodels stellar atmosphere code”, we compute three idealized 
cases of changes in fractional incidence of magnetism due to convection 
inhibition among white dwarfs. The results are presented in Fig. 3a. 
Details of these computations can be found in the Methods. One of these 
predictions (the solid line) exhibits excellent agreement with observa- 
tions, theoretically supporting our findings. 

In fact, the increasing fraction of strongly magnetic white dwarfs 
with age can also be inferred from inspection of published samples*** 
(Fig. 3b). Finally, the effect of slowing the evolution is directly seen in 
Fig. 4, where we plot observed*”° spatial motions of MWDs arranged 
by their surface field strengths into two groups. We find that the cool/ 
convective stars show a statistically significant increase in their space 
velocities and velocity dispersions with field strength, indicative” of 
increasing age with increasing field strength. Unfortunately, a similar 
comparison of the MWD population with the population ofnon-MWDs 
is complicated in several ways, as explained and analysed in the Methods. 
From this analysis we conclude that the coolest white dwarfs with the 
strongest fields are the oldest white dwarfs in the Galaxy. This conclu- 
sion and possible alternative explanations of the statistical phenomena 
under consideration are also discussed in the Methods. 

The arguments considered above support the existence of a magnetic 
suppression of cooling in strongly magnetic, isolated white dwarfs. This 
prompts a revision of our interpretation of the MWD cooling sequence 
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Figure 4 | Absolute values of spatial velocities | V| of MWDs versus their 
surface temperatures. a, The sample of all MWDs with known*” | V). 

b, Analysis of the mean absolute velocities <|V|> and dispersions D of 
MWDs. The difference A<|V|> in the mean velocities between the 

group of cool MWDs with field strengths lower than 5 MG (filled triangles) 
and the group of MWDs with stronger fields (open circles) is 
A<|V|>=27+ 7.36kms_'. b, The sample of all known white dwarfs 
within 25 pc from Earth. See Methods for a detailed explanation of this plot. 


that, in turn, may require tuning of our understanding of the evolution 
of the Galaxy and the Universe. 


Online Content Methods, along with any additional Extended Data display iterns 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Relationships in Fig. 2. We believe that the two relationships (branches in Fig. 2) 
between the observed’® semi-amplitudes of the photometric variability of MWDs 
and their surface magnetic field strengths reveal two different processes. In the first 
branch, variability decreases with field strength; we interpret this to be due to the 
presence of global suppression of convection. At the upper end of the first branch, 
for stars with magnetic fields between several hundred kilogauss and one mega- 
gauss, the photometric variability shows maximum amplitudes. Following this max- 
imum the amplitudes begin to decrease, owing to a saturation of the effect with 
increasing field. Stars with stronger fields may become increasingly variable owing 
to changes in the character of the convection inhibition (see below) and/or due 
to other mechanisms such as opacity dichroism", variable cyclotron absorption'®, 
blanketing effect’” and so on. All together, these mechanisms form the second, 
rising branch in Fig. 2. Two remarkable MWDs in Fig. 2 (WD 1658+441 and 
Feige 7) require a brief discussion. The ultra-massive, very hot star WD 1658+441 
cannot be classified within this study—its variation has an alternative explana- 
tion’®. The hot star Feige 7 is marked by an open circle in Fig. 2 despite the fact that 
this star is convective due to its helium-rich atmosphere’’. Studies'** have shown 
that this unique white dwarf exhibits a non-uniform surface helium abundance 
distribution produced by partially suppressed convection controlled by its very 
strong magnetic field, that is, the brightness variation of Feige 7 is also not caused 
by the mechanism we propose here. 
Details on modelling. In this study, the observed brightness variation of WD 1953- 
011 was modelled numerically by testing an empirical dependence of the local 
surface temperatures T\,, on the local modulus of the magnetic field |H]ioc of the 
form Tig¢ = A|H|joc ” where A and y are arbitrary constants. The magnetic field 
intensities Hj,. at each point of the stellar surface were taken from our previous 
studies’’. The integration and minimization methods used to find optimal values 
of the arbitrary constants are standard, and are also described in the literature''. 
To compute model atmospheres of white dwarfs we used the LLmodels stellar 
atmosphere code”. The code can treat strongly non-solar abundance patterns and 
therefore is applicable to white dwarf stars with pure hydrogen, helium or mixed 
compositions. The Stark-broadened profiles of hydrogen lines were computed using 
tables*' based on the VCS theory’. The occupation probability formalism of Hummer 
and Mihalas* was used in computations of hydrogen level populations in dense 
plasma conditions. The convective energy transport was computed using the CM 
model of convection described by Canuto and Mazzitelli****. A commonly used treat- 
ment of convection based on mixing-length theory” is also provided in the code. 
To calculate changes in fractional incidence of magnetism among white dwarfs 
due to convective inhibition we used the following relationships. In the first-guess 
approximation the fraction F(T) of MWDs in a given interval of temperatures is 
proportional to characteristic times” of the cooling evolution t(T) ~ (L(T))~ 7. 
Computing white dwarf atmospheres using the atmosphere model code described 
above we estimate relative changes in luminosities L(T) taking convective energy 
transport into account and with inhibited convection L;(T) considering different 
possible cases of inhibition. The resultant fractions F(T) of MWDs with inhibited 
convections were then recalculated relative to the observed fractions among radi- 
ative MWDs: 


R(T) = FAL (T)/L(T))-*” 


where F, is the observed fraction of radiative MWDs (about 4%; ref. 4). Three cases 
of inhibition were considered (see Fig. 3a): when convection is fully inhibited in a 
MWD ’s atmosphere (above a Rosseland optical depth, Tauross = 1, short dashed 
line); when convection is inhibited at Tauross = 10 (solid line); and when con- 
vection is fully inhibited (long dashed line). The case when convection is inhibited 
directly under the photosphere (the solid line) reproduces the observations well. 
This result characterizes a ‘mean’ case of convective inhibition resulting from an 
averaged value of magnetic field intensities of all known MWDs. This result also 
suggests that all cool MWDs typically increase their cooling times by a factor of 
two to three. In reality, however, MWDs with different magnetic field intensities 
are expected to reveal different efficiencies of the magnetic inhibition effect: the 
stars with weakest fields should be less affected than stars with stronger fields (see 
also below). This requires us to provide some additional explanation by qualitative 
consideration of the convective inhibition mechanisms and their characteristic 
relationship with magnetic field strength. 

Convective inhibition mechanisms. According to studies”®, magnetic fields pro- 
foundly affect convective energy transfer when the magnetic pressure/energy is 
comparable to the material pressure/energy in some layers of a star’s convective 
zone. In these layers convective motions tend to stochastically twist magnetic field 
lines, causing the field to strongly resist the convective motions independently of 
the field orientation and proportionally to the square of the field modulus”. WD 1953- 
011 is a comparatively weak-field MWD, a good example of a star for which the 


equality of magnetic and gas pressures occurs not very far beneath the photo- 
sphere. This is similar to what is observed in sunspots. Using numerical computa- 
tions of external convective layers” we establish that this example represents MWDs 
with magnetic fields from a few hundred kilogauss to a few megagauss, for tem- 
peratures cooler than 8,000 K (ifhydrogen-rich like WD 1953-011), or cooler than 
16,000 K (if helium-rich). We define this group as weak-field MWDs with typical 
field strengths <5 MG. 

For larger fields (tens of megagauss or more, which are common among the 
strongly magnetic MWDs) the similarity of magnetic and gas pressures is reached 
in very deep layers which correspond to the lower borders of the convective zone in 
cool hydrogen-rich MWDs with temperatures lower than 6,000 K, or for helium- 
rich MWDs with temperatures lower than 14,000 K. Above these layers the mag- 
netic energy is much larger than the gas energy. In this case residual convection is 
present, but is restricted by the magnetic field to occur strictly along magnetic field 
lines'*’*”°. We briefly consider this interesting case in more detail. 

Because convective motions are redirected by magnetic fields along the magnetic 
lines, the efficiency of convective energy transport depends also on the orientation 
of the field. In those parts of the star’s surface where the field is essentially horizontal 
(perpendicular to the gravity vector), vertical convection is reduced by the Lorentz 
force acting across magnetic field lines'*"*. In regions corresponding to the poles of 
a dipole, where the field and gravity vectors are oriented mainly in the same direc- 
tion, convection is inhibited through reduction of the convective instability, with a 
quadratic dependence on the field’’. As a result”’, convective cells are strongly elon- 
gated along magnetic field lines and the associated energy transport is reduced 
depending on the field strength. That is, we may also speak about global inhibition 
of convective energy transport with some efficiency that monotonously and asymp- 
totically grows with field strength. However, the character of the expected pho- 
tometric variability due to convective inhibition in strong-field MWDs may be 
different from the case of the weak-field star WD 1953-011. There are also a few 
intermediate cases of MWDs with comparatively thin convective zones (like Feige 7). 
For such stars inhibition of convective energy transport is expected to be incon- 
sequential in the context of inhibition of cooling. 

In general, accurate values of convective inhibition efficiency can be obtained 
only by three-dimensional magnetohydrodynamical calculations that are outside 
the scope of this study. Instead, the effectiveness of the influence of magnetic fields 
on inhibition of cooling and its dependence on an MWD’s surface field can also be 
tested using observations, as illustrated in Fig. 4 and related explanations in the 
text. As can be seen, the group of cool weak-field MWDs with fields less than 5 MG 
exhibits significantly lower spatial velocities and velocity dispersions than the group 
of cool, strong-field stars. This argues that the cool, weak-field MWDs are younger 
than the strong-field MWDs of the same temperatures. It is also interesting to note 
that the inhibition of convection can be very effective in weak-field MWDs (for 
example, in WD 1953-011), but the effect works close to the surface in compara- 
tively thin zones that makes the effect of magnetic inhibition of cooling less effec- 
tive than in the strong-field MWDs (‘thin inhibition’). In contrast, the strongest 
fields of MWDs entirely control the convective zone that, especially in cool stars, 
creates a ‘thick inhibition’ that slows their cooling evolution substantially, as shown 
in Fig. 4. 

Statistical analysis for Fig. 4b. Finally, we discuss the kinematical properties of 
populations of MWDs and non-MWDs (which are probably white dwarfs with 
very weak fields below the detection limit). This comparison could additionally 
provide a test for the effect of magnetic suppression of cooling in strong-field MWDs. 
In general, this is not a trivial question because in contrast to the more or less homo- 
geneous sample of MWDs originating mainly from intermediate-mass magnetic A/B 
stars*°°’, non-MWDsare produced via a variety of different evolutionary channels***. 
Many of these channels may not produce MWDs. Among these channels the most 
populated is probably low/intermediate-mass white dwarfs originating from non- 
magnetic convective main-sequence stars of spectral classes later than A/B. These 
white dwarfs have essentially different space velocities distributed over a much 
larger range (Fig. 4b). Following the conventional point of view that MWDs are 
the remnants of intermediate-mass magnetic A/B stars**’’ we may assume that 
the most massive non-MWDs of the same masses as MWDs originated from non- 
magnetic A/B stars. A comparison of the kinematical properties of MWDs and 
non-MWDs of the same masses could give an answer to the question of whether 
the magnetic population is older than the non-magnetic one. In Fig. 4b we plot 
those non-MWDs with masses exceeding 0.8 solar masses, which are masses typical 
of M™WDs’ (filled squares in Fig. 4). The mass estimates are taken from the most 
recent survey of white dwarfs with known trigonometric parallaxes*’. As can be 
seen, these white dwarfs are located within the population of MWDs. Unfortu- 
nately, the sample of white dwarfs with known masses is very limited, complicating 
our analysis. Moreover, some of these white dwarfs might be from other evolu- 
tionary channels. In addition, the masses we used have significant uncertainties, 
typically about 0.15 solar masses at the 3a level, and may have some computational 


©2014 Macmillan Publishers Limited. All rights reserved 


biases owing to model simplifications. However, despite these problems, analysis 
of this population at least does not contradict our hypothesis that the strongly 
magnetic MWDs are the oldest white dwarfs of the same temperatures. The mas- 
sive non-MWDs marked by squares in Fig. 4 demonstrate the characteristic velo- 
city <|V|> = 45.6 + 3.6kms | anda dispersion D = 13.0kms° |. Comparison of 
these values with the corresponding values for the population of the strongly mag- 
netic MWDs (<|V|> = 54.6 +5.9kms '; D=21.0kms |, see Fig. 4) addition- 
ally supports (especially comparison of the dispersions D) our arguments that the 
strongly magnetic MWDs are older than the non- MWDs of the same masses and 
temperatures. This, in turn, supports our main finding, according to which cooling 
of strongly magnetic white dwarfs is suppressed by their magnetic fields. On the 
other hand, this analysis serendipitously provides the astronomical community with 
independent support of the idea®” that MWDs are descended from intermediate- 
mass magnetic A/B stars. 

Finally, we briefly comment on some alternative ideas related to the problems 
discussed in this paper. According to the relationship connecting cooling times, masses 
and luminosities (see above), it may be that the surface magnetic-field strengths 
of white dwarfs may depend on their masses (the larger the mass of a MWD, the 
smaller its radius and therefore the stronger its surface field). We may therefore 
expect an alternative relationship connecting masses, field strengths and ages. How- 
ever, examination of this problem by other authors’ as well as our own examination 
of MWDs plotted in Fig. 4 reveals no systematic differences in masses and gravities 
of groups of stars with different magnetic field strengths. 

Other explanations for the statistical features in the distribution of MWDs illus- 
trated in Fig. 3 include an evolutionary increase of surface magnetic fields of MWDs 
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due to an unknown mechanism of magnetic field generation’, or an evolution-induced 
diffusion of strong field from the subphotospheric envelope into the photosphere’”. 
The existence of these mechanisms, however, has not been confirmed theoretically** 
or observationally in later studies**”’. 
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High-efficiency acceleration of charged particle beams at high gradi- 
ents of energy gain per unit length is necessary to achieve an affordable 
and compact high-energy collider. The plasma wakefield accelerator 
is one concept’ * being developed for this purpose. In plasma wake- 
field acceleration, a charge-density wake with high accelerating fields 
is driven by the passage of an ultra-relativistic bunch of charged par- 
ticles (the drive bunch) through a plasma**. Ifa second bunch of rela- 
tivistic electrons (the trailing bunch) with sufficient charge follows 
in the wake of the drive bunch at an appropriate distance, it can be 
efficiently accelerated to high energy. Previous experiments using 
just a single 42-gigaelectronvolt drive bunch have accelerated elec- 
trons with a continuous energy spectrum and a maximum energy of 
up to 85 gigaelectronvolts from the tail of the same bunch in less than 
a metre of plasma’. However, the total charge of these accelerated 
electrons was insufficient to extract a substantial amount of energy 
from the wake. Here we report high-efficiency acceleration of a dis- 
crete trailing bunch of electrons that contains sufficient charge to 
extract a substantial amount of energy from the high-gradient, non- 
linear plasma wakefield accelerator. Specifically, we show the accel- 
eration of about 74 picocoulombs of charge contained in the core of 
the trailing bunch in an accelerating gradient of about 4.4 gigavolts 
per metre. These core particles gain about 1.6 gigaelectronvolts of 
energy per particle, with a final energy spread as low as 0.7 per cent 
(2.0 per cent on average), and an energy-transfer efficiency from the 
wake to the bunch that can exceed 30 per cent (17.7 per cent on aver- 
age). This acceleration of a distinct bunch of electrons containing a 
substantial charge and having a small energy spread with both a high 
accelerating gradient and a high energy-transfer efficiency represents 
a milestone in the development of plasma wakefield acceleration into 
a compact and affordable accelerator technology. 

The experiment reported here is carried out in the three-dimensional, 
nonlinear regime of plasma wakefield acceleration, also known as the blow- 
out regime’. In this regime, a tightly focused and short ultra-relativistic 
electron bunch with a density greater than the plasma density propagates 
through a long column of plasma. o, Sc / @, and a, S1c i Wp are the 
root-mean-square (r.m.s.) transverse and longitudinal sizes of the 
beam, respectively, with «, the plasma frequency. The transverse elec- 
tric field of this drive bunch expels all of the plasma electrons within a 
radius of about 30 jum, as shown in the three-dimensional particle-in- 
cell (QuickPIC””®) simulation depicted in Fig. 1a. The Coulomb field of 
the stationary ions pulls the expelled plasma electrons back towards the 
central axis, which begins the wake oscillation, producing periodic ion 
cavities in the plasma. This wake structure follows the beam trajectory 
with a phase velocity matched to the drive bunch, at nearly the speed 
of light. 

In the simulation, the input plasma and beam parameters are similar 
to those measured in the experiment with a simple scaling of the total 
beam charge (see Methods). The on-axis longitudinal electric field E, of 


the wake, also depicted in Fig. 1a, shows that the bulk of the drive bunch 
is located in a region of positive (forward-directed) electric field, and 
thus loses energy. If the electrons in the rear of such a drive bunch were 
to extend into the negative region of the electric field, they would gain 
energy from the wake. If there were not enough charge in the long tail 
of electrons to have a non-negligible impact on the profile of the steep 
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Figure 1 | Three-dimensional particle-in-cell simulation of beam-driven 
plasma wakefield interaction. a, A slice through the centre of an unloaded 
plasma wake, where x is the dimension transverse to the motion, and € = z — ct 
is the dimension parallel to the motion, E, is the on-axis longitudinal 

electric field (red solid line) and J, is the current of the input beam (blue dotted 
line). b, A plasma wake generated by the same drive bunch as in a when 
loaded by a trailing bunch. The plasma electron density is represented in blue, 
while the beam density is represented in red. The ion density (not shown) is 
uniform. The particle-in-cell code QuickPIC*"” was used to generate this 
simulation of the beam-plasma interaction. 
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wakefield E,, they would end up with a large final energy spread and 
leave behind a substantial amount of unused energy in the wake, as was 
the case in earlier experiments'’. 

The key to achieving a high energy-transfer efficiency with a narrow 
final energy spread is in placing an appropriately shaped trailing bunch 
at precisely the correct distance behind the drive bunch so as to flatten 
the wakefield E, and provide for a uniform energy gain throughout the 
trailing bunch'*”*. Theory and simulations have shown that a trailing 
bunch with a Gaussian distribution and the right charge content can 
lead toa nearly flattened wakefield*”’. Figure 1b shows an approximately 
double-Gaussian (in z) input-charge distribution for both the drive and 
trailing bunches. The wakefield is noticeably altered by the addition of 
the trailing bunch charge, that is, the beam has loaded the wake. Note 
that the initial extent of the trailing bunch extends past the first ion 
cavity period (the region void of electrons), or ‘bucket’ of the wake. This 
leads to loss of charge because some of the electrons in the trailing bunch 
are strongly defocused by the returning plasma electrons at the back of 
the first bucket. 

To reach accelerating gradients on the scale of more than a gigaelec- 
tronvolt per metre, the plasma medium must have a high electron den- 
sity, which results in a correspondingly short plasma wavelength. This 
necessitates a very small separation between the drive and trailing bunches, 
and such a configuration is non-trivial to produce with ultra-relativistic 
electron beams’’. The typical scale of a plasma wakefield acceleration 
wake structure at a plasma density of 5 X 10'°cm * is around 200 jum, 
as can be seen in Fig. 1, and therefore the separation between the charge- 
density peaks of the two bunches must be less than this value. The Facility 
for Advanced Accelerator Experimental Tests (FACET) at the SLAC 
National Accelerator Laboratory, where the experiment was carried out, 
was designed specifically to produce such high-current drive and trail- 
ing bunches with an appropriately small separation. 

In the experiment, the SLAC linear accelerator provides a 20.35-GeV 
electron beam to the FACET experimental area, where it is then manip- 
ulated to form a two-bunch structure (see Methods) just before enter- 
ing the plasma source, which is a 36-cm-long, laser-ionized column of 
lithium vapour with a density of 5 X 10'° cm * contained inside a heat 
pipe oven’”'® (see Methods). The final charge delivered to the plasma 
totals 1.80 nC, with 1.02 nC contained in the drive bunch (o, = 25 Lm, 
Tpeak = 4.9 kA), and 780 pC contained in the broader trailing bunch 
(o, = 47 pum, Ipeak = 2.0 kA). Not all of this charge is coupled into the 
plasma wake, however, owing to the oversized width and length of the 
trailing bunch. After exiting the plasma, the spectrum of the electrons 
was diagnosed using a two-screen imaging magnetic spectrometer (see 
Methods). 

Figure 2 shows examples of electron spectra from three individual 
electron beam shots. The first shot, shown in Fig. 2a, gives the nominal 
spectrum of the two-bunch beam when the plasma source is not present. 
The second shot, shown in Fig. 2b, shows the measured spectrum when 
the plasma source is present and the spectrometer is set to optimally 
image 20.35-GeV electrons. This setting allows the energy loss of the drive 
bunch to be well characterized, while the trailing bunch, which has gained 
energy and has a higher divergence than the decelerated drive bunch, is 
strongly defocused. Of the approximately 1 nC of charge contained in the 
drive bunch entering the plasma, approximately 450 pC + 63 pC (+s.d.) 
appears at energies lower than 20.35 GeV. Figure 2c shows the spectrum 
of another shot similar to that shown in Fig. 2b but with the spectrom- 
eter set to image 22.35 GeV. Because the mean energy of the acceler- 
ated trailing bunch is close to the energy focus, it can readily be seen on 
the detector, whereas now the lowest-energy portion of the drive bunch 
has been strongly defocused, as expected. The centroid of the trailing 
bunch core in this shot has gained about 1.6 GeV of energy. The charge 
contained in the core of the accelerated portion of the spectrum is deter- 
mined by an asymmetric Gaussian fit to the peak of the spectral pro- 
jection for each shot (see Methods). The result of the core fit for the shot 
in Fig. 2c is shown in Fig. 2d as the dashed red line. Of the approximately 
800 pC in the initial trailing bunch, the simulation indicates that only 
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Figure 2 | Energetically dispersed beam profiles. a, The dispersed electron 
beam profile without plasma interaction, where the spectrometer is set to image 
22.35 GeV. Because the beam divergence is small, the entire spectrum of the 
beam is well resolved even at this imaging set point. b and c, The dispersed 
beam profile after the electron bunches have interacted with the plasma, 
where the spectrometer is set to image 20.35 GeV and 22.35 GeV, respectively: 
Efocus in b and c. The left x and top y axes correspond to the actual scale of 
the electron beam recorded at the spectrometer diagnostic screen, while 

the bottom E axis shows the calibrated energy axis along the dispersive 
dimension y. The scaling factors in a—c apply to the colour scale, quantifying 
transverse charge density. d, The spatially integrated spectrum (in x) or the 
linear charge density of the bunches shown in c (solid blue line) along with 
the final spectrum obtained from the simulation depicted in Fig. 1b (solid green 
line in d). The core of the accelerated trailing beam is shown for the data 
(dashed red line). 


about half of that resides in the first bucket of the wake. Of this, roughly 
200 + 33 pC of charge appears above 20.35 GeV at the detection plane 
with about 74 + 18 pC of this contained in the core. The spread in charge 
quoted above refers to a data set containing 92 shots (discussed below). 
The projected energy spectrum (integrated over x) for the shot shown 
in Fig. 2c is presented in Fig. 2d as the blue line. For comparison, the 
final spectrum of the simulated two-bunch plasma wakefield accelera- 
tion interaction shown in Fig. 1b—including transport losses and the 
focusing effects of the imaging spectrometer set to image at 22.35 GeV— 
is also plotted as the green line in Fig. 2d, and shows a good qualitative 
agreement with the experimental spectrum. 

The full data set used in this analysis consists of 92 electron beam shots 
taken with the imaging spectrometer set to 22.35GeV. The projected 
spectral profile for each shot was obtained and the resulting 92 spectra 
are plotted in Fig. 3 as a waterfall plot. The profiles are sorted according 
to the ‘total efficiency’ of energy transfer from the drive bunch to the trail- 
ing bunch via the wake. Here, the total efficiency (shown as the black line 
in Fig. 3) is defined as the net energy gain of all accelerated charge in 
the trailing bunch divided by the net energy loss of the drive bunch (see 
Methods). The mean total efficiency observed in the data set is 29.1% 
with a standard deviation of 8.9% and a maximum value of around 50%. 
For comparison, the total efficiency obtained from the simulation shown 
in Figs 1b and 2d is 49%. The core efficiency is shown in Fig. 3 as the red 
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Figure 3 | Spatially integrated electron beam spectra from the data set. All 
92 shots from the data set are shown with the imaging spectrometer set to image 
22.35 GeV. The colour scale represents the energy spectrum in nC GeV", 
similar to the blue curve of Fig. 2d, but transformed using the known dispersion 
of the spectrometer. The shots are sorted by the total energy-transfer efficiency 
(black line), as calculated using all charge above 20.35 GeV. The core 
energy-transfer efficiency, as calculated using only the charge found in the 
accelerated core of the trailing bunch, is shown as the red line. The black 

(or red) horizontal bar represents the typical systematic uncertainty of +5% 
(or +3%) for the total energy-transfer efficiency (or core energy-transfer 
efficiency). 


curve, which has a mean value of 17.7% with a standard deviation of 
6.3%. It is lower than the total efficiency, as expected, owing to the smaller 
amount of charge contained in the core. The maximum core efficiency 
observed is over 30%. 

We note that here we have measured a high energy-transfer efficiency 
from the drive bunch (via the wake) to the trailing bunch, though the 
drive bunch electrons lost only a small fraction of their total energy in 
our 36-cm-long plasma source. However, since the electrons in both 
bunches are ultra-relativistic, there is no relative motion between the 
bunches and the wake structure (see Fig. 1b and Supplementary Video 1), 
which means that the energy-transfer efficiency remains constant over 
the entire flat-density region of the plasma source (see Methods). There- 
fore, in a sufficiently long plasma source, nearly all of the energy in the 
drive bunch would be transferred to the trailing bunch with the same 
efficiency that we have measured. For the present experimental condi- 
tions, the length of the plasma would need to be extended to approxi- 
mately 4 m in order to fully deplete the available energy in the drive bunch. 

In this work, the average energy gain of the core of the trailing bunch 
is estimated to be 1.6 + 0.1 GeV. For our plasma length of about 36-cm 
full-width at half-maximum (FWHM), this represents an accelerat- 
ing field gradient of 4.4GV m_', which is in good agreement with the 
5GVm ' seen in the simulation (red curve in Fig. 1b). The smallest 
energy spread of the trailing bunch core was 0.7% (compared to the initial 
energy spread of the trailing bunch of about 1%; see Methods), and the 
average energy spread was 2.0%, where the energy spread is defined as 
the r.m.s. E/E for each shot. The simultaneous achievement of high 
gradient, high efficiency, and narrow energy spread demonstrated here 
represents an important advance in the development of a collider based 
on plasma wakefield acceleration”. 

Increasing the peak current of the drive bunch by increasing its charge 
will in turn increase the amount of energy transferred from the drive 
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Figure 4 | Energy-transfer efficiency dependence on wake loading. The total 
efficiency (black circles) and core efficiency (red diamonds) versus the initial 
ratio of the charge in the trailing bunch to that in the drive bunch before 
entering the plasma source. The circles (diamonds) represent the same data that 
make up the black curve (red curve) in Fig. 3. The black (red) vertical bar 
represents the typical systematic uncertainty of +5% (+3%) for the total 
energy-transfer efficiency (core energy-transfer efficiency). In this data, the 
total charge is held constant, and only the ratio of the charge of the trailing 
bunch to the charge of the drive bunch is varied. 


bunch to the wake, thereby increasing the strength of the wakefield. 
This also increases the amount of charge needed in the trailing bunch 
to optimally load the wake and maximize the energy-transfer efficiency. 
We have used the natural spectral jitter of the incoming electron beam 
to measure this correlation between wake loading and energy transfer 
efficiency in our data. 

Figure 4 shows a scatter plot of the shot-by-shot total and core effi- 
ciencies (black circles and red diamonds, respectively) obtained for the 
same data as in Fig. 3 plotted against the trailing-to-drive bunch charge 
ratio. An systematic error bar is shown for the same data point in each 
figure for both the total efficiency and the core efficiency values, where 
the leading source of uncertainty comes from the energy loss estima- 
tion (see Methods). The ratio of the charge of the trailing bunch to the 
charge of the drive bunch is used as a means of quantifying the loading 
of the wake by the trailing bunch normalized to the strength of the wake 
itself, which we here assume scales with the charge of the drive bunch. 
The observed correlation demonstrates the dependence of the energy- 
transfer efficiency on the loading of the wake and shows that a suffi- 
ciently high charge in the trailing bunch is necessary to attain a high 
efficiency. Although the final energy spread of the accelerated core of 
the trailing bunch is small at 2%, the increase from around 1% indicates 
that the shape of the trailing bunch profile inside the wake is not fully 
optimized for loading and flattening the wakefield, as shown in the sim- 
ulation of Fig. 1b. The trailing bunch would need to be shorter in extent 
to more optimally flatten the wakefield. 

In conclusion, high-efficiency acceleration ofa distinct trailing bunch 
of electrons containing a substantial charge and having a small-energy 
spread has been demonstrated in a high-gradient, beam-driven plasma 
wakefield accelerator. These results bring plasma wakefield acceleration 
one step closer to becoming a viable accelerator technology. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Generation of two-bunch beam structure. The SLAC linear accelerator provides 
a 20.35-GeV electron beam with a charge of 3.2 nC to the FACET experimental 
area. To create a two-bunch structure with a peak-to-peak separation of ~135 um, 
a single high-charge bunch with a roughly linear head-to-tail correlated energy spread 
is energetically dispersed and physically segmented in a plane transverse to its direc- 
tion of motion. 

Extended Data Fig. 1 shows a schematic of the FACET experimental area. The 
five dipole bend magnets at the vertices of the ‘W ’-shaped beam line forma chicane, 
which allows for the manipulation of correlations between an electron’s momen- 
tum p, and its longitudinal position z within the bunch. In inset 1, the red-to-blue 
colour transition indicates the initial longitudinal energy correlation generated in 
the linear accelerator upstream of the experimental area. Higher energies (red) are 
at the head of the bunch, and there is no transverse (p_, x) correlation at this point. 
After some propagation beyond the first dipole magnet in the chicane, the beam 
becomes highly dispersed in the transverse plane with a strong linear (p-, x) cor- 
relation (inset 2). At this point, a 305-tm-wide, 16-mm-thick tantalum bar (the 
‘notching device’ in Extended Data Fig. 1a), is inserted into the middle of the ~2-mm- 
wide beam. Electrons that intercept this tantalum bar lose energy through colli- 
sions and are scattered at large angles, causing them to be ejected from the beam 
line by the strong magnets further downstream in the chicane. This process removes 
~40% of the initial charge in the beam (inset 3). At a symmetric location before the 
final dipole of the chicane (inset 4), the (p,, x) correlation of the bunch returns with 
the opposite sign. Here, a compact magnetic wiggler (Extended Data Fig. 1c) yields 
synchrotron X-rays that sweep across a scintillating YAG screen, creating a profile 
of the notched electron spectrum. This is the ‘initial spectrometer’, providing an 
optical replica of the notched electron beam spectrum. Inset 5 shows that after leav- 
ing the chicane the bunch is ‘over-compressed’, that is, the lower energy particles 
are in the front of the now segmented beam, comprising the drive bunch, while the 
higher energy particles are in the rear, comprising the trailing bunch. The longit- 
udinal profile of the two-bunch structure is measured in the time domain with a 
transverse deflecting X-band radio-frequency structure (Extended Data Fig. 1b) that 
streaks the beam onto a profile monitor screen located near the plasma source. 
Electron beam characteristics. The electron beam sent into the plasma source has 
an incoming energy of 20.35 GeV with a FWHM spread of +2%, and a total charge 
of 1.80 nC. The drive bunch contains 1.02 nC, with a r.m.s. length of o, = 25 um, 
and peak current Ipeax = 4.9 kA. The broader trailing bunch contains 780 pC, with 
6, = 47 wm, and Ipeax = 2.0 kA. The trailing bunch also exhibits a correlated energy 
spread of about 1% r.m.s. The peak-to-peak separation of the two bunches is 135 um. 
After exiting the chicane and before entering the plasma source, the beam is focused 
by a series of five quadrupole magnets, labelled the ‘final focus quadrupole magnets’ 
(Extended Data Fig. 1d), down to an r.m.s. transverse spot size of 30 um in the 
middle of the plasma density up-ramp. The normalized emittance of the beam in 
the transverse dimensions is roughly €,,. = 358 mm mrad and €,,, = 35.8 mm mrad. 
Extended Data Fig. 2 shows the longitudinal profile of a typical two-bunch beam 
used in the experiment, as measured by the transverse deflecting radio-frequency 
structure. 

Plasma source. The plasma source used in this experiment is a laser-ionized column 
of lithium vapour contained inside a heat pipe oven’””* with a uniform density of 
5.0 X 10'° cm’ over a 26-cm-long region with 10-cm density ramps on either side, 
giving a FWHM length of approximately 36 cm. The lithium vapour column is 
depicted in Extended Data Fig. 1f. Extended Data Fig. 3 shows the density profile 
of the neutral vapour pressure density deduced from the measured temperature pro- 
file along the oven containing the lithium as well as the simple fit used to describe the 
density profile in our model. The density and length of the lithium vapour is con- 
trolled through the temperature of the oven and the pressure of a room-temperature 
noble gas (argon) serving as a buffer for the lithium at either end of the pipe. A 200- 
fs-long Ti:sapphire laser pulse (Extended Data Fig. 1e) containing 250 mJ of energy 
is focused by a 1.5° axicon lens that produces a zero-order Bessel beam profile through 
the full length of the lithium vapour, which in turn creates a plasma column ~1 mm 
in diameter’*. The laser arrives 100 ps before the arrival of the electron beam, which 
is over an order of magnitude earlier than substantial recombination of the plasma 
is expected to occur. Calculations indicate that the peak intensity of the laser is 
sufficient to ionize a ~20-1m-diameter filament in the buffer gas for up to 30 cm 
upstream of the lithium vapour. Interaction of the beam with this filament may lead 
to some nonlinear focusing of the beam, preventing some of the electrons in the 
incoming beam from cleanly coupling into the lithium plasma and thus preventing 
them from participating in the experiment. 

Electron imaging spectrometer. The spectrum of the electrons exiting the plasma 
is diagnosed with an imaging spectrometer having two diagnostic screens. An imag- 
ing quadrupole doublet, depicted in Extended Data Fig. 1g, is required to capture 
and deliver the electrons over the long distance between the exit of the plasma and the 
two screens. A strong dipole magnet, in Extended Data Fig. 1h, vertically disperses 


the electrons onto the two screens, shown as Extended Data Fig. 1i. A camera view- 
ing one screen records Cherenkov light produced by the beam in a 1.4-cm air gap 
between two silicon wafers, while another camera records the scintillation light 
produced by a phosphor screen after the electron beam has passed through it. The 
imaging condition at the Cherenkov screen for the analysed data shown in Figs 2c, 
3 and 4 is for an energy of 22.35 GeV in both the horizontal and vertical (dispersive) 
plane. The imaging condition at the phosphor screen, located 1 m upstream of the 
Cherenkov screen, is for 22.25 GeV in the horizontal plane, and 21.50 GeV in the 
vertical plane. The combination of these different imaging conditions provided con- 
fidence that the signal observed in the data was not simply the result of an enhance- 
ment introduced by the focusing condition of the beam. 

The spatial resolution of the spectrometer in the energy plane is dominated by 
scattering of the beam in the vacuum-to-air exit window. The r.m.s. multiple scat- 
tering angle due to collisions of the beam particles within the solid exit window is 
calculated to be 143 rad, using standard formulae. The profile monitor is located 
95 cm downstream of the exit window, yielding a contribution to the spatial resolu- 
tion of 135 um. The measured spatial resolution is about 150 um. We define the 
energy resolution as the spatial resolution divided by the dispersion, times the beam 
energy. The dispersion induced by the spectrometer dipole, at the nominal FACET 
beam energy of 20.35 GeV, is 62 mm. This gives an energy resolution of 76 MeV at 
the nominal energy of 20.35 GeV. The energy resolution scales as energy squared, 
yielding an energy resolution of 91 MeV for the highest imaging energy referred to 
in the paper, 22.35 GeV. The total distance from the object plane to the imaging 
plane is 22.6 m, and the imaging properties of the spectrometer shows therefore 
little sensitivity to the position of the object plane on the order of 10 cm or less. For 
example, calculations show that the error in the imaged energy is on the order of 
1% or less for particles with an angle of up to 1 mrad at the object plane and for 
errors of the position of the object plane up to +10 cm. 

The beam size after the spectrometer is imaged with a magnification of 0.5 in the 
energetically dispersed (y) plane, and the spot size in this dimension is thus dom- 
inated by the energy spread. In the non-dispersed (x) plane, the spectrometer mag- 
nifies by a factor of 5.3, so the beam size as it appears on our diagnostic screen and 
as it is plotted in Fig. 2 is larger than it is at the plasma exit plane by this factor. 
Plasma wakefield acceleration simulation. The simulations presented in Fig. 1 
and Fig. 2d use the quasi-static, three-dimensional, particle-in-cell code QuickPIC””. 
The input beam is Gaussian in both transverse and longitudinal dimensions, with 
r.m.s. sizes and emittance values corresponding to those measured for the experi- 
mental beam, as listed in the ‘Electron beam characteristics’ section. The simulated 
beam has zero initial energy spread and the initial charge in the simulation corre- 
sponds to roughly 75% of the charge measured upstream of the plasma source in 
the experiment, as this yields the best final spectrum agreement with the data in 
Fig. 2d. The reduced charge in the simulation is probably compensating for electrons 
in the experimental beam that are unable to couple into the plasma wake owing to 
potential factors not captured in the simulation, such as non-Gaussian tails in the 
charge distribution. The simulated plasma density profile was based on the mea- 
sured plasma source profile: a 26-cm-long flat-top density of 5.0 X 10'® cm~* with 
10-cm-long density ramps on each side, giving a FWHM length of 36 cm. 

For this simulation run, a snapshot of the particles and fields was recorded 120 
times over a 51-cm propagation distance (including transition into and out of the 
plasma density ramps on either end), thus fully resolving the oscillations of the enve- 
lope of the electron bunch which occurs on approximately a centimetre scale. The 
simulation box tracks the beam-plasma interaction in the coordinates x, y, € = z — ct; 
that is, the box moves at light speed, close to the velocity of the bunches, although it 
sees the beam and plasma in the reference frame of the laboratory. The box has 
a size of 601 tm X 601 jum X 481 pm in the two transverse dimensions and the 
longitudinal dimension, respectively. The number of the cells for the simulation 
box is 512 X 512 X 512 (~134 million cells in total). 

The full, 120-frame simulation movie from which Fig. 1b was taken is available 
as Supplementary Movie 1. It depicts the evolution of the two-bunch beam struc- 
ture as it propagates through the plasma. The upper portion of the frame shows the 
charge density of the beam and the plasma wake, and the lower portion of the frame 
shows the evolution of the beam’s energy as a function of the longitudinal position 
inside the plasma wake. 

In the simulation, the drive bunch can be seen to be scalloped because the ion 
channel is not formed instantaneously within the beam. A slice at the front of the 
beam feels no focusing force while later slices feel a progressively stronger force 
(progressively higher betatron oscillation frequency) up to the slice that resides in 
the fully formed ion channel. A new scallop forms after each full betatron oscilla- 
tion for the slices in the ion channel’. The curvature of the scalloping is due to a 
nonlinear dependence of the focusing force during the rise time of the bunch when 
the ion column is still forming. A similar scalloping structure is also observed in the 
experiment, as seen on the energy loss portion of the drive bunch spectrum shown 
in Fig. 2b. 
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Transfer efficiency, energy spread, and core charge estimation. The net energy 
gain for a particular shot is calculated as Egain = ) 0p, >, (Ei — Eo) qi, where E;is the 
calibrated beam energy corresponding to pixel row i on the charge-coupled device 
(CCD) image, Ey = 20.35 GeV is the initial beam energy, and q; is the total amount 
of charge observed in pixel row i. The energy loss may be calculated in the same 
way, though the energy loss is known to be under-represented in the 92 shots ana- 
lysed in Figs 3 and 4 owing to the energy setting of the imaging quadrupoles. Thus, 
we instead use an estimation of the energy loss for each shot that is derived from an 
empirical quadratic correlation (normalized residual is 0.87) between the energy 
loss observed on the Cherenkov detector and the initial drive bunch charge mea- 
sured upstream of the plasma source for a data set taken when the quadrupole 
imaging condition was set to 20.35 GeV (as in Fig. 2b) immediately before the 
primary 92-shot data set. For these conditions, the decelerated drive bunch could 
be well observed on the Cherenkov detector and the net energy loss well quantified. 

To determine the energy gain of the core of the accelerated bunch, a fit was 
performed to the projected spectrum of each shot in the data set. Figure 2c shows a 
dispersed beam image of a single shot with the imaging spectrometer set to image 
22.35 GeV. The spectral projection of this shot, shown in Fig. 2d, indicates a narrow 
peak near 22 GeV, anda more diffuse, continuous distribution of accelerated charge 
with a higher divergence that extends down to the initial beam energy. A fit to the 
projected spectrum above 21 GeV is performed that is the sum of two distinct parts: 
an asymmetric Gaussian to characterize the peak that occurs near 22 GeV, and a 
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half-Gaussian to account for the diffuse, high-divergence charge. The asymmetric 
Gaussian portion of the fit is then used to quantify the properties of the accelerated 
core of the trailing bunch, such as the charge, energy gain, energy spread and energy- 
transfer efficiency. Extended Data Fig. 4 shows the fit that was used for the shot 
displayed in Fig. 2c and d, where the red line indicates the core of the accelerated 
trailing bunch. The same function describing the core is also shown as the dashed 
red line in Fig. 2d. 

The 92 shots analysed in the main paper come from two data sets of 50 shots 
each, taken within minutes of one another, while the imaging spectrometer was set 
to image an energy of 22.35 GeV. Of the total 100 combined shots, 8 were rejected 
as outliers in which the beam-plasma interaction was substantially weaker than in 
the remaining 92 shots. This was quantified by the amount of ‘non-participating 
charge’, that is, the amount of charge found within a +2% energy window about 
the beam’s initial energy of 20.35 GeV (corresponding to the initial beam’s FWHM 
energy spread), and by the energy value above the initial beam energy with the greatest 
charge density, or the ‘peak energy’ of the accelerated electrons. A two-dimensional 
cut on peak energy and non-participating charge is applied to the data that rejects 
the 8 low interaction shots. Five of these shots had the lowest drive-bunch charge 
(and thus the lowest peak current) of the complete data set. A low-peak-current 
drive bunch impedes the ability to form a wake in the blowout regime, leading to a 
weak interaction in the plasma. All eight of the rejected shots may also have had 
large transverse sizes, which can similarly impede the ability to form a strong wake. 
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Extended Data Figure 1 | FACET experimental area schematic. Electron magnets, h, spectrometer dipole magnet, and i, Cherenkov and phosphor 
beam line features: a, beam notching device, b, transverse deflecting structure, screens. Bend dipole magnets in the “W’-shaped chicane are each labelled ‘D’. 
¢, initial spectrometer, d, final-focus quadrupole magnets, e, lithium plasma The arrow beneath the e symbol indicates the electron beam’s direction of 


ionization laser, f, lithium vapour column, g, spectrometer imaging quadrupole _ motion (left to right). 
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Extended Data Figure 2 | Measured longitudinal profile of two-bunch monitor screen, while the colour axis indicates the charge density of the 
beam. Image of a typical two-bunch beam streaked onto a profile monitor transverse profile. The bottom (z) axis shows the streaked dimension (y) with 
screen by the transverse deflecting radio-frequency structure (Extended the appropriate scaling factor applied to give the corresponding longitudinal 


Data Fig. 1b). The drive bunch appears on the right-hand side. Overlaid on the _ coordinate. The right axis shows the linear charge density corresponding to the 
image is the projected longitudinal profile (red line). The left (x) and top (y) —_ projected longitudinal profile. 
axes show the transverse dimensions of the streaked beam on the profile 
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Extended Data Figure 3 | Lithium vapour column density profile. The 
profile of the neutral vapour pressure density of the lithium vapour column 
deduced from the measured temperature profile (temperature versus relative 
distance of insertion of a thermocouple probe) along the heat pipe oven is 
shown as the blue line. The simple fit used to describe the density profile in our 
model is shown as the red line. 
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Extended Data Figure 4 | Fit to accelerated charge. The blue line is the 
spectral projection of the same data shot shown in Fig. 2c and d. The green 
line is a fit to the data using a half-Gaussian tail (cyan line) to account for the 
diffuse, high-angular-divergence accelerated charge plus a full, asymmetric 
Gaussian (red) used to describe the core of the accelerated trailing bunch after 
subtracting the half-Gaussian tail. 
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Solution-processed, high-performance 
light-emitting diodes based on quantum dots 


Xingliang Dai’, Zhenxing Zhang’, Yizheng Jin', Yuan Niu’, Hujia Cao’, Xiaoyong Liang’, Liwei Chen’, Jianpu Wang“ 


& Xiaogang Peng” 


Solution-processed optoelectronic and electronic devices are attrac- 
tive owing to the potential for low-cost fabrication of large-area devices 
and the compatibility with lightweight, flexible plastic substrates. 
Solution-processed light-emitting diodes (LEDs) using conjugated 
polymers or quantum dots as emitters have attracted great interest 
over the past two decades”. However, the overall performance of 
solution-processed LEDs” °—including their efficiency, efficiency 
roll-off at high current densities, turn-on voltage and lifetime under 
operational conditions—remains inferior to that of the best vacuum- 
deposited organic LEDs**. Here we report a solution-processed, 
multilayer quantum-dot-based LED with excellent performance and 
reproducibility. It exhibits colour-saturated deep-red emission, sub- 
bandgap turn-on at 1.7 volts, high external quantum efficiencies of 
up to 20.5 per cent, low efficiency roll-off (up to 15.1 per cent of the 
external quantum efficiency at 100 mA cm”), andalong operational 
lifetime of more than 100,000 hours at 100 cd m~*, making this device 
the best-performing solution-processed red LED so far, comparable 
to state-of-the-art vacuum-deposited organic LEDs” *. This opto- 
electronic performance is achieved by inserting an insulating layer 
between the quantum dot layer and the oxide electron-transport layer 
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to optimize charge balance in the device and preserve the superior 
emissive properties of the quantum dots. We anticipate that our 
results will be a starting point for further research, leading to high- 
performance, all-solution-processed quantum-dot-based LEDs ideal 
for next-generation display and solid-state lighting technologies. 
Quantum dots are solution-processable semiconductor nanocrystals”"" 
that promise size-tunable emission wavelengths, narrow emission line- 
widths, near-unity-photoluminance quantum yield and inherent pho- 
tophysical stability. As inorganic crystalline emission centres, quantum 
dots are expected to be promising candidates to overcome stability prob- 
lems of both polymer LEDs and small-molecule organic LEDS (OLEDs), 
such as drastic efficiency roll-off at high current densities and low opera- 
tional lifetime. To fully exploit the superior properties of quantum dots, a 
number of quantum-dot-based LED (QLED) structures were developed 
and various materials, including small molecules, conjugated polymers 
and inorganic oxides, were explored as charge-transport interlayers*’?°. 
Our device (Fig. 1a, b) consists of multiple layers of, in the following 
order, indium tin oxide (ITO), poly(ethylenedioxythiophene):polysty- 
rene sulphonate (PEDOT:PSS, 35 nm), poly (N,N’ -bis(4-butylphenyl)- 
N,N'-bis(phenyl)-benzidine) (poly-TPD, 30 nm), poly(9-vinlycarbazole) 


Figure 1 | Multilayer QLED device. a, Device 
structure. b, Cross-sectional transmission electron 
microscopy image showing the multiple layers of 
material with distinct contrast. Scale bar, 100 nm. 
The PMMA layer is evident only when the cross- 
sectional sample is sufficiently thin (d) because the 
neighbouring quantum dot layer and the ZnO layer 
can obstruct the imaging of the PMMA layer. HTL, 
hole-transport interlayer. c, Flat-band energy level 
diagram. d, High-magnification transmission 
electron microscopy image of an extremely thin 
cross-sectional sample revealing the presence of the 
PMMA layer between the ZnO layer and the 
quantum dot layer. Scale bar, 5 nm. 
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(PVK, 5 nm), CdSe-CdS core-shell quantum dots (QDs, 40 nm), poly 
(methyl methacrylate) (PMMA, 6 nm), ZnO nanoparticles (150 nm) and 
silver (Ag, 100 nm). Figure 1c shows a schematic of the flat-band energy 
level diagram of the layers. The energy level values for ITO, PEDOT:PSS 
and ZnO were obtained by ultraviolet photoelectron spectroscopy and 
optical measurements. Other energy level values were taken from refs 3, 
21-23. Phase-pure zinc blende CdSe—CdS core-shell nanocrystals with 
ten monolayers of CdS shell are used. These quantum dots possess 
a photoluminance quantum yield of >90% and outstanding optical 
properties** (Extended Data Fig. 1). Bilayer-structured hole-transport 
interlayers of poly-TPD/PVK take advantage of the deep highest-occupied- 
molecular-orbit energy level of PVK to realize efficient hole injection 
into the quantum dot layers and the relatively high hole mobility of poly- 
TPD to achieve low turn-on voltage and high power efficiency. Thin films 
of colloidal ZnO nanocrystals (Extended Data Fig. 2) are employed as 
electron-transport interlayers (ETLs) because of their unique combina- 
tion of high electron mobility, ease of preparation and the previously 
identified benefit of efficient electron injection into the quantum dot 
layers'”'*®. The key component of this device, a thin insulating PMMA 
layer (Fig. 1d), is inserted between the ZnO ETL and the quantum dot 
emissive layer. 

The normalized electroluminescence spectrum of the QLED is shown 
in Fig. 2a. The symmetric emission peak at 640 nm with a narrow full- 
width at half-maximum of 28 nm corresponds to Commission Inter- 
nationale de l’Eclairage (CIE) colour coordinates of (0.71, 0.29), which 
are close to the spectral locus and represent colour-saturated deep-red 
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Figure 2 | Device performance. a, Electroluminescence spectrum at an 
applied voltage of 3 V and, inset, a photograph of a device with the Zhejiang 
University logo. a.u., arbitrary units. b, The corresponding CIE coordinates. 
c, Current density and luminance versus driving voltage characteristics for the 
device with the best efficiency. d, EQE versus current density and luminance 
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emission ideal for display applications (Fig. 2b). Figure 2c shows the 
current density—voltage and luminance-voltage characteristics ofa device 
with the best efficiency. The current density and luminance increase 
steeply once the voltage reaches ~1.7 V, yielding a maximum bright- 
ness of over 42,000 cd m™ at 8 V. The peak external quantum efficiency 
(EQE), 20.5%, is achieved at a current density of ~7 mA cm “anda 
brightness of ~ 1,200 cd m ~.The peak EQE of this device, 20.5%, is the 
highest value for QLEDs*!*. High EQE can be maintained in a wide 
range of current densities (Fig. 2d), that is, EQE > 18% when the current 
density is in the range of 1-42 mA cm *, which corresponds to a bright- 
ness in the range of 100-6,600 cd m~*. When the current density reaches 
100 mA cm *, an EQE of >15% is sustained. The low efficiency roll-off 
of this device, which is better than that of the other high-efficiency QLED 
(peak EQE, 18.5%) with vacuum-deposited hole-transport interlayers* 
and is comparable to those of state-of-the-art vacuum-deposited OLEDs'*, 
suggests that our QLEDs are promising for high-power applications. 
The established solution-processing protocol leads to devices with excel- 
lent reproducibility. As shown by the histograms for 27 devices from 
four batches (Fig. 2e), both the high average peak EQE, 18.7%, and the 
low relative standard deviation of peak EQE, 4.3%, are encouraging. 
Our QLEDs, simply sealed by ultraviolet-curable resin and without 
other complicated encapsulation techniques, exhibit outstanding ambi- 
ent stability under high-brightness conditions. As shown in Fig. 2f, fora 
typical device tested at a constant driving current density of 100 mA cm *, 
which corresponds to an initial luminance, Lo, of 10,600 cd m 7, the half- 
lifetime, T;9, defined as the time for the luminance to decrease to L/2, 
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for the device with the best efficiency. e, Histogram of peak EQEs measured 
from 27 devices. f, Stability data for a QLED device (L, luminance). The device 
was test at ambient conditions (temperature, 20-25 °C; relative humidity, 
50-70%). 
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is 95h. By using the relation Lo” Tso = const. and assuming an accel- 
eration factor of n = 1.5 (ref. 25), Tso for this device at 100 cdm * is 
predicted to be over 100,000 h. The remarkable operational stability of 
our device, along with its outstanding efficiency, low efficiency roll-off, 
sub-bandgap turn-on voltage and excellent reproducibility, marks a 
milestone in the production of QLEDs for practical applications. 

For solution-processed LEDs, layer-by-layer deposition of high-quality 
films without intermixing is essential for constructing high-performance 
devices. In our QLEDs, six layers, including PEDOT:PSS, poly-TPD, 
PVK, quantum dots, PMMA and ZnO, are deposited from solution. 
The top Ag electrodes are fabricated by vacuum deposition. Atomic 
force microscopy and scanning Kelvin probe microscopy analyses show 
that the six solution-processed layers have pin-hole-free features (Extended 
Data Fig. 3) and evenly distributed surface potentials (within +40 meV; 
Extended Data Fig. 4). Confocal microscopy imaging reveals that the 
quantum dot layers have homogeneously emissive properties. Speci- 
fically, optical measurements (Extended Data Fig. 5) indicate an aver- 
age thickness of ~6 nm for the PMMA layer in the QLEDs. The root 
mean squared roughness of the quantum dot film is in the range of 
1.6-2.6 nm. After depositing the PMMA layer, the root mean squared 
roughness decreases to 0.6-1.6 nm. On the basis of the above facts, we 
suggest that the PMMA layer on the quantum dot film is continuous, 
and that the local thickness of the PMMA layer fluctuates owing to the 
relatively rough quantum dot surface (Fig. 1d). 

We highlight that the incorporation of the insulating PMMA layer 
with a suitable thickness between the ZnO ETL and the quantum dot 
emissive layer optimizes charge balance in the device. In our device, 
there is a moderate energetic barrier for hole injection owing to the deep 
valance-band energy level of the quantum dots (Fig. 1c). Furthermore, 
the hole mobility of poly-TPD”! (1 X 10-*cm? V~'s~') and PVK” 
(2.5X 10 °cm?V ‘'s ‘)areoneto three orders of magnitude lower than 
the electron mobility of ZnO nanocrystal films (~1.8 X 10° cm? V's‘) 
(Extended Data Fig. 2b). These factors can lead to excess electron injec- 
tion into the quantum dot emissive layer. The unbalanced charge injec- 
tion in the QLEDs is reflected by the significantly higher current densities 
of the electron-only devices (ITO/Al/QDs/ZnO/Al) than those of the 
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Figure 3 | Impacts of the 6nm PMMA layer. a, Current density and 
luminance versus voltage characteristics for QLEDs without and with the 6nm 
PMMA layer. b, Stability data fora QLED without the PMMA layer. c, Current 
density—voltage curves for the electron-only devices showing that the 6 nm 
PMMA layer results in a ~ 1.8-fold decrease in current density. The thicknesses 
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hole-only devices (ITO/PEDOT:PSS/poly-TPD/PVK/QDs/Pd) (Extended 
Data Fig. 6). 

The excess electron current in the QLED without the PMMA layer 
degrades device performance. As shown in Fig. 3a, for the QLED without 
the PMMA layer, both the turn-on voltage and the brightness in the 
entire range of forward bias are nearly the same as for the device with 
the 6 nm PMMA layer, whereas the current densities are much greater. 
Therefore, the excess electron current substantially lowers the efficiency 
of the device. Furthermore, the QLEDs without the PMMA layers exhibit 
relatively poor stability. As shown in Fig. 3b, when tested at a constant 
driving current density of 100 mA cm *, Ts for this device with an Ly of 
6,600 cd m * is only 10h. Converting to the values at the same initial 
brightness of 100 cd m ’, this is only ~5% of that for the device with 
the 6nm PMMA layer. Therefore without efficient radiative recombi- 
nation to release the energy, the excess electron current can deteriorate 
the QLEDs rapidly under operational conditions. 

It is possible to modulate the electron injection from the ZnO ETLs 
to the quantum dot layers and eliminate excess electron currents in the 
QLEDs by adjusting the thickness of the inserted PMMA layers. When 
a PMMA layer is applied in the electron-only devices (ITO/Al/QDs/ 
PMMA/ZnO/A]; Fig. 3c and Extended Data Fig. 7a), the current den- 
sity gradually reduces as the thickness of the PMMA layer increases. 

For the working devices, a gradual decrease in current density on the 
increase of the thickness of the PMMA layers is also observed (Fig. 3a 
and Extended Data Fig. 7b). For the device with the 6 nm PMMA layer, 
a peak EQE of 20.5% (Fig. 2d), corresponding to a close-to-unity internal 
quantum efficiency (estimated to be 88.6% by assuming a Lambertian 
emission profile”*), indicates almost perfect charge balance in this device. 
This is consistent with the low efficiency roll-off and improved opera- 
tional stability of the QLEDs with the 6 nm PMMA layers (Fig. 2d, f). 
Further increasing the thickness of the PMMA layers to >6 nm results 
in both increase of the turn-on voltage and decease of the brightness 
(Extended Data Fig. 7c, d). These control experiments, along with the 
results described above, clearly suggest that charge balance in the QLEDs 
can be optimized by inserting an insulating layer with a suitable thick- 
ness. Either excess electron injection or over-blocking electron current 
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of the ZnO layer and the quantum dot layer are 150 and 40 nm, respectively. 
d, Time-resolved photoluminance decay for the quantum dot films contacting 
different layers. The thicknesses of the layers are identical to those in the 
optimized QLED. 
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deteriorates charge balance in the QLEDs and thereby degrades device 
performance. 

For both photoluminance” and electroluminescence”’, charging 
degrades the emissive properties of the quantum dots. In our case, when 
the quantum dots are in direct contact with the ZnO ETLs, a spontane- 
ous charge transfer process occurs owing to the work function difference, 
leaving positively charged quantum dots’. Charging of the quantum dots 
causes inefficient trion emissions”, as indicated by the fact that the aver- 
age photoluminance lifetime of the quantum dot films decreased from 
21.6 to 10.6 ns after the deposition of the top ZnO films (Fig. 3d). The 
insertion ofa thin PMMA layer modifies the QD/Zn0O interfacial inter- 
action, increasing the lifetime of the quantum dot film to 19.5 ns (Fig. 3d). 
These results indicate that the PMMA layers help to maintain charge 
neutrality of quantum dot emitters and preserve their superior emis- 
sive properties. 

Finally, the strategy of inserting insulating layers between oxide ETLs 
and quantum dot layers can be extended to improve hybrid QLEDs with 
other types of quantum dot emitters. Wurtzite-structured CdSe-CdS 
core-shell nanocrystals with four monolayers of CdS shell’* are used as 
an example. When these thin-shell quantum dots are used as emitters, 
a device structure of ITO/PEDOT:PSS/poly-TPD/PVK/QDs/ZnO/Ag 
yields a peak EQE of 2.5%. When a 6nm PMMA layer is inserted, the 
peak EQE of the QLEDs increases to 4.7% (Extended Data Fig. 8). 

The present work demonstrates the best-performing solution-processed 
red LEDs with colour-saturated emission, record efficiency, low efficiency 
roll-off, sub-bandgap turn-on voltage, excellent reproducibility and out- 
standing operational stability, whose overall performance is comparable 
to state-of-the-art OLEDs produced by vacuum deposition® * (Extended 
Data Table 1). Such outstanding optoelectronic performance is achieved 
by introducing the conceptually new device structure resulting from 
the insertion of an insulating layer between the quantum dot layer and 
the oxide ETL, and by using quantum dots with superior properties as 
solution-processed inorganic emissive centres. We believe that there are 
no fundamental obstacles to extending these techniques to differently 
coloured QLEDs, which would lead to low-cost, large-area, high-efficiency, 
high-colour-quality, stable, all-solution-processed electroluminescent 
devices for both display and solid-state lighting technologies. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Materials. PVK (average molecular weight, 25,000-50,000 g mol — 1 PMMA (aver- 
age molecular weight, ~ 120,000 g mol” ') and zinc acetate dehydrate (>98%) were 
purchased from Sigma Aldrich. Tetramethylammonium hydroxide (TMAH, 98%), 
1-dodecanethiol (98%), dodecane (99%), octylamine (98%), cadmium oxide (CdO, 
99.998%), 1-octadecene (ODE, 90%) and oleic acid (HOI, 90%) were purchased from 
Alfa-Aesar. Sodium diethyldithiocarbamate trihydrate (NaDDTC:3H,0, 99%) was 
purchased from Aladdin Reagents. Cadmium acetate dihydrate (Cd(Ac)2*2H20, 
98.5%) was purchased from Shanghai Tingxin Reagents. Chlorobenzene (extra dry, 
99.8%), m-xylene (extra dry, 99%), octane (extra dry, >99%), ethanol (extra dry, 
99.5%), 2-ethanolamine (99%) and oleylamine (80—90%) were purchased from 
Acros. Dimethyl sulphoxide (DMSO, HPLC grade) and ethyl acetate (HPLC grade) 
were purchased from J&K Chemical Ltd. Acetone was purchased from Sinopharm 
Chemical Reagents. Poly-TPD was purchased from American Dye Source. Patterned 
ITO-glass substrates (sheet resistance, 15Qsq_') were purchased from Xiamen 
Weihua company. All materials were used as received. 

Synthesis of CdSe-CdS core-shell quantum dots. The CdSe-CdS core-shell quan- 
tum dots (ten monolayers of CdS shell) with phase-pure zinc blende structure were 
synthesized according to ref. 28 with some modifications. Briefly, the zinc blende 
CdSe cores (3.1 nm) were synthesized and purified according to our recent report”. 
For the shell coating, dodecane (3.8 ml), octylamine (1.05 ml), oleylamine (0.45 ml) 
and purified CdSe core solution containing 2 X 107” mol of nanocrystals were mixed 
and heated to 80 °C under an argon flow. Reaction cycles, that is, addition of the 
Cd(DDTC)>-amine precursor solutions at 80 °C and growth of CdS monolayers at 
150°C for ~20 min, were performed for the growth of the first six monolayers. 
Desirable amounts of Cd(DDTC)>-amine solutions, that is, 0.08, 0.12, 0.16, 0.21, 
0.26 and 0.32 ml, were used for the growth of first, second, third, fourth, fifth and 
sixth monolayers, respectively. For the seventh to tenth monolayers of CdS, the pre- 
cursor solution was changed to 50 mol% of Cd(DDTC), and 50 mol% of Cd(Ol)2, 
and the growth temperature was set at 160 °C. Quantities 0.39, 0.47, 0.55, and 0.64 ml 
of the precursor solutions were used for the growth of seventh, eighth, ninth and 
tenth monolayers of CdS, respectively. The resulting CdSe-CdS core-shell nano- 
crystals were purified and subjected to a ligand exchange procedure. The original 
ligands were replaced by 1-dodecanethiol. The ligand-exchanged quantum dots 
were dispersed in octane and filtered before use. 

Colloidal ZnO nanocrystals. Colloidal ZnO nanocrystals were synthesized by a 
low-temperature solution-precipitation method” with some modifications. A DMSO 
solution (30 ml) of zinc acetate hydrate (3 mmol) was mixed with an ethanol solu- 
tion (10 ml) of TMAH (5.5 mmol) and stirred for 24h under ambient conditions. 
Then the ZnO nanocrystals were precipitated by adding ethyl acetate and redis- 
persed in ethanol. Additional ligands of 2-ethanolamine (160 ll) was introduced to 
stabilize the nanoparticles. The ZnO nanocrystals were further washed with ethyl 
acetate and redispersed in ethanol. The solutions were filtered before use. 
Device fabrication. PEDOT:PSS solutions (Baytron P VP Al 4083, filtered through 
a 0.45 um N66 filter) were spin-coated onto the ITO-coated glass substrates at 
4,000 r.p.m. for 60s and baked at 140 °C for 10 min. The PEDOT:PSS-coated sub- 
strates were transferred into a nitrogen-filled glove box (O2 < 1 p.p.m., HO < 1 p.p.m.). 
Poly-TPD (in chlorobenzene, 8 mg ml), PVK (in m-xylene, 1.5mgm_ 1. quan- 
tum dots (in octane, 15 mg ml ~ 1), PMMA (inacetone, 1.8 mg ml ') and ZnO nano- 
crystals (in ethanol, 50 mg ml ') were deposited layer by layer by spin coating at 
2,000 r.p.m. for 45 s. The poly-TPD and PVK layers were baked at 110 °C for 20 min 
and at 170 °C for 30 min, respectively, before the deposition of the next layer. Finally, 
Ag electrodes (100 nm) were deposited using a thermal evaporation system through 
a shadow mask under a high vacuum of ~6 X 10’ torr. The device area was 4 mm” 
as defined by the overlapping area of the ITO and Ag electrodes. The devices were 
encapsulated in the glove-box by the cover glasses using ultraviolet-curable resin. 


Characterizations. We used a Keithley 2400 electrometer for current density-voltage 
characterizations anda fibre integration sphere (FOIS-1) coupled with a QE-65000 
spectrometer for light output measurements (Extended Data Fig. 9). Note that the 
ITO glass substrates are in close contact with the input port of the integration sphere 
(but are not inserted into the integration sphere). The area of the QLED device 
(4mm7’) is much smaller than that of the input port (9.5 mm in diameter) so that 
the coupling factor for the photons emitted into the forward viewing directions” 
(from the QLED to the integration sphere) is unity. The electroluminescence char- 
acteristics of the QLEDs were cross-checked using a system comprising a photometer 
(Spectra Scan PR655) coupled with a computer-controlled Keithley 2400 electro- 
meter in L. Liao’s laboratory. The half-lifetime Ts) and the driving voltages are 
measured at the same time using an ageing system made by Shanghai University. 

The absorption spectra of the nanocrystals were measured using a Shimadzu 
UV 3600 spectrophotometer. The photoluminance spectra of the quantum dots were 
obtained by using an Edinburgh Instruments FLS920 spectrometer. The absolute 
photoluminance quantum yield of the quantum dot solution was measured using 
an Ocean Optics FOIS-1 integrating sphere coupled with a QE65000 spectrometer. 
The time-resolved fluorescence spectra of the quantum dot films were measured by 
the time-correlated single-photon counting method using an Edinburgh Instru- 
ments FLS920 fluorescence spectrometer. The samples were excited by a 405 nm 
pulsed diode laser (EPL-405). An Olympus confocal laser scanning microscope 
(FV1000) equipped with an inverted fluorescence microscope (IX81) was used to 
evaluate the emissive features of the quantum dot films. A 488 nm laser was used to 
excite the sample, and the signals in the wavelength range of 580-680 nm were col- 
lected for imaging. Fourier transform infrared spectroscopy (FTIR) spectra were 
obtained using a Bruker Vector 27 spectrophotometer. 

Transmission electron microscope (TEM) analyses on the cross-sections of the 
QLEDs were carried out using a Tecnai G2 F20 microscope. The cross-sectional 
samples were prepared by using focused-ion-beam equipment (Quata 3D FEG). 
Atomic force microscopy (AFM) measurements were conducted on either a Park 
XE-120 atomic force microscope or an Agilent 5500 AFM (Agilent Technologies) 
using silicon AFM tips (HQ:NSC18 and HQ:NSC15, Mikromasch) or high-resolution 
probes (Hi’RES-C19/AIBS). Scanning Kelvin probe microscopy measurements were 
performed on a Park XE-120 atomic force microscope using Cr-Au-coated con- 
ducting AFM tips (HQ:NSC19, Mikromasch). The thicknesses of the multilayers 
were measured using a Dektak 150 stylus profilometer. 

The transmission electron microscopy images of the quantum dots were made 
on a Hitachi 7700 TEM operated at 80 keV. The X-ray diffraction measurements 
were conducted using a Rigaku Ultimate-IV system operated at 40 kV and 40 mA 
using the Cu Ke line (A = 1.5418 A). 
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Extended Data Figure 1 | CdSe-CdS core-shell quantum dots with ten monolayers of CdS shell. a, A typical TEM image. Scale bar, 20 nm. b, X-ray diffraction 
profile. c, Ultraviolet-visible absorption and photoluminance spectra. 
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Extended Data Figure 2 | ZnO nanocrystals as ETLs. a, Ultraviolet-visible  space-charge-limited-current region (J x V’) with Child’s law, 
absorption spectrum and a typical TEM image (inset: scale bar, 50nm) of the J=(9/8)é£0/t. Vv / d°, where & , Es H. and d are the vacuum permittivity, 
colloidal ZnO nanocrystals. b, Current density—voltage (J-V) characteristics relative permittivity, electron mobility and film thickness, respectively’’. 
of an electron-only device (ITO/AI/ZnO/Al). The thickness of the ZnO layer By assuming that ¢, = 4, 1, is determined to be 1.8 10 *cm*V_'s 1. 
is 300 nm. The electron mobility of the ZnO film is obtained by fitting 
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Extended Data Figure 3 | AFM characterizations of the multilayers of to the tip-to-tip and sample-to-sample variations. Extensive AFM 
PEDOT:PSS, poly-TPD, PVK, quantum dots, PMMA and ZnO films in the | measurements show that the root mean squared roughnesses for the quantum 
device configuration, respectively. For each layer, the height image, the dot layer and the PMMA layer are in the ranges of 1.6-2.6 nm and 0.6-1.6 nm, 


line-scan profile, the pseudo-three-dimensional image and the phase image are _ respectively. 
shown. Note that the surface root mean squared values may change owing 
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Extended Data Figure 4 | Scanning Kelvin probe microscopy characterizations of the multilayers of PEDOT:PSS, poly-TPD, PVK, quantum dots, PMMA 
and ZnO films in the device configuration. Note that the data have been linearly fitted to show the spatial uniformity. 
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Extended Data Figure 5 | FTIR analyses to determine the average thickness 
of the PMMA layers used in the QLEDs. The sample for FTIR measurements 
was produced by layer-by-layer spin-coating the PVK (in m-xylene, 

1.5 mg ml’), quantum dots (in octane, 15 mg ml!) and PMMA (in acetone, 
1.8 mg ml!) at 2,000 r.p.m. onto the cleaned CaF, substrates. We assume 
that the average thickness of the PMMA layer in this sample is identical to 
that of the PMMA layer in the optimized QLEDs. The absorption of the 
carbonylic groups of this sample was measured multiple times, averaged 

and compared with that of a 90 nm PMMA film (determined by stylus 
profilometer). Given that the absorbance of the carbonylic groups is in the 
dynamic range of the instrument, the thickness of the PMMA layer deposited 
onto the quantum dot film was determined to be ~6 nm. 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


LETTER 


—*— |TO/PEDOT:PSS/HTLs/QDs/Pd 
—®*— |TO/PEDOT:PSS/HTLs/QDs(20nm)/Pd 
—4— Al/QDs/ZnO/AI 


Current density (mA cm’) 


1 0 


10 10 
Voltage (V) 


Extended Data Figure 6 | Electrical measurements on the electron-only 
devices (ITO/Al/QDs/ZnO/Al) and the hole-only devices (ITO/ 
PEDOT:PSS/poly-TPD/PVK/QDs/Pd). The current density of the electron- 
only device (ITO/Al/QDs/ZnO/Al) is more than one order of magnitude 
greater than that of the hole-only device (ITO/PEDOT:PSS/poly-TPD/PVK/ 
QDs/Pd). In the above two devices, the thicknesses of all layers are identical 
to those used in the QLEDs. For the quantum dot layer, the thickness is 

~40 nm. We note that for quasi-type-II CdSe-CdS quantum dots, the electron 
wavefunction extends to the shell region, whereas the hole wavefunction 
remains confined to the CdSe core, leading to greater electron mobility than 
hole mobility. We presume that the recombination zone is close to the 
PVK/QDs interface due to the very low hole mobility of the quantum dot films. 
Therefore we also fabricated a hole-only device with a quantum dot layer of 
~20 nm. The results show that the current density of the hole-only device with 
the 20 nm quantum dot layer is still much smaller than that of the electron-only 
device with the 40 nm quantum dot layer. 
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Extended Data Figure 7 | Impact of the thickness of the PMMA layer onthe — of peak EQEs and turn-on voltages (V;;,) of the QLEDs on the thicknesses of 
QLED performance. a, Current density—applied bias curves for the electron- the PMMA layers. The PMMA layers with thicknesses of 5, 6, 7, 8 and 

only devices (ITO/Al/QDs/PMMA/ZnO/A]). b, ¢, Current density—driving 10 nm were deposited from acetone solutions with concentrations of 1.5, 1.8, 
voltage (b) and luminance-driving voltage (c) curves for the QLEDs 2.1, 2.4 and 3.0 mg ml‘, respectively. 
(ITO/PEDOT:PSS/poly-TPD/PVK/QDs/PMMA/ZnO/Ag). d, Dependence 
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Extended Data Figure 8 | QLEDs with four-monolayer-shell CdSe-CdS of the PEDOT:PSS, poly-TPD, PVK, quantum dot, PMMA and ZnO layers are 


quantum dots as emitters. a, Current density and luminance versus driving 35, 30, 5, 30, 6 and 150 nm, respectively. 
voltage characteristics. b, Curves of EQE versus driving voltage. The thicknesses 
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Extended Data Figure 9 | QLED characterization system. a, A Keithley 
2400 electrometer is used to obtain current density—voltage characteristics. 

A fibre integration sphere (FOIS-1) coupled with a QE-65000 spectrometer is 
used for light output measurements. b, Top-view and side-view of the QLEDs 
in direct contact with the input port of the fibre integration sphere. The 

glass substrate (19 mm X 19 mm) is rested on top of (but not inserted into) 
the integration sphere. The area of the QLED device (4 mm’) is much smaller 
than that of the input port (9.5 mm in diameter) of the integration sphere so 
that the coupling factor for the photons emitted into the forward viewing 
directions (from the QLED to the integration sphere) is unity. 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


LETTER 


Extended Data Table 1 | Comparison of our device with other high-performance red LEDs 


Device’ Peak EQE EQE(%) @ tWall-plug E,,(%) Vin CIE ‘T,, @ 
(%) 100 mA cm? @ 100 mA cm? (V) coordinates 100 cd m? 

Our QLED 20.5 15.1 5.5 ula (0.71,0.29) 100,000 
QLED* 18.5 12 48 1.5 N/A 4,000 
PLED* 18 15 2.5 6.0 (0.64,0.35) N/A 
PLED*' 13 N/A N/A N/A (0.63,0.32) N/A 
S-OLED* 20.59 N/A N/A 6.0 (0.67,0.33) N/A 
Ph-OLED® 20 <10 <5.3 2.4 (0.63,0.37) > 1000,000 
Ph-OLED** 17.7 N/A N/A N/A (0.67,0.33) N/A 
TADF-OLED”:** 17.5 11 3.6 3.0 (0.61,0.39) < 40,000 


Our device is compared with the other high-performance red QLEDs, polymer LEDs (PLEDs), solution-processed small-molecule OLEDs (S-OLEDs) and vacuum-deposited OLEDs including both phosphorescence 
organic LEDs (Ph-OLEDs) and those using thermally activated delayed fluorescence emitters (TADF-OLEDs) in literature reports. Key parameters including peak EQE, EQE at 100 mAcm 2, wall-plug efficiency 


(wall-plug E;;) at 100 mA cm 2, turn-on voltage (Vin), CIE coordinates and Tso lifetime are listed. The parameters of our device are highlighted as blue. The parameters superior or inferior to those of our devices are 
highlighted as green or red, respectively. 


*This table does not include vacuum-deposited OLEDs with optical engineering (light coupling structures or horizontally aligned emitters) or tandem structures. 
+The wall-plug Ey is estimated from EQE and photon energy at the electroluminescence peak. 


{The Tso lifetime is converted from the values for different initial luminances to the values for an initial luminance of 100 cd m~? by assuming an acceleration factor of 1.5. 
§In ref. 7, the lifetime data are available only for an orange-emitting device. 
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Asymmetric photoredox transition-metal catalysis 


activated by visible light 


Haohua Huo!, Xiaodong Shen', Chuanyong Wang’, Lilu Zhang’, Philipp Rose', Liang-An Chen”, Klaus Harms', Michael Marsch’, 


Gerhard Hilt! & Eric Meggers!” 


Asymmetric catalysis is seen as one of the most economical strate- 
gies to satisfy the growing demand for enantiomerically pure small 
molecules in the fine chemical and pharmaceutical industries’. And 
visible light has been recognized as an environmentally friendly and 
sustainable form of energy for triggering chemical transformations 
and catalytic chemical processes”. For these reasons, visible-light- 
driven catalytic asymmetric chemistry is a subject of enormous current 
interest” >. Photoredox catalysis provides the opportunity to generate 
highly reactive radical ion intermediates with often unusual or uncon- 
ventional reactivities under surprisingly mild reaction conditions®. 
In such systems, photoactivated sensitizers initiate a single electron 
transfer from (or to) a closed-shell organic molecule to produce rad- 
ical cations or radical anions whose reactivities are then exploited 
for interesting or unusual chemical transformations. However, the 
high reactivity of photoexcited substrates, intermediate radical ions 
or radicals, and the low activation barriers for follow-up reactions 
provide significant hurdles for the development of efficient catalytic 
photochemical processes that work under stereochemical control and 
provide chiral molecules in an asymmetric fashion’. Here we report 
a highly efficient asymmetric catalyst that uses visible light for the 
necessary molecular activation, thereby combining asymmetric catal- 
ysis and photocatalysis. We show that a chiral iridium complex can 
serve as a sensitizer for photoredox catalysis and at the same time 
provide very effective asymmetric induction for the enantioselective 
alkylation of 2-acyl imidazoles. This new asymmetric photoredox 
catalyst, in which the metal centre simultaneously serves as the exclu- 
sive source of chirality, the catalytically active Lewis acid centre, and 
the photoredox centre, offers new opportunities for the ‘green’ syn- 
thesis of non-racemic chiral molecules. 

Recently, strategies have been developed in which efficient catalytic 
photochemical processes that work under stereochemical control and 
provide chiral molecules in an asymmetric fashion can be carried out by 
two catalysts that work in tandem for a single chemical transformation’. 
In such dual-catalyst reactions, visible-light redox sensitizers are com- 
bined with asymmetric co-catalysts, such as chiral secondary amines’, 
chiral N-heterocyclic carbenes™, chiral Bronsted acids’’, chiral Lewis 
acids”®, or chiral thiourea’’. With respect to single catalysts, ultraviolet 
light in combination with hydrogen bonding or Lewis acid interaction 
has been used previously in pioneering work to trigger enantioselective 
catalysis'**°, and an enantioselective cycloaddition induced by visible 
light—although not including photoinduced electron transfer—has been 
reported”; also, an interesting but special case of photoactivated enamine 
catalysis was disclosed recently in which a transient electron donor- 
acceptor complex is capable of absorbing visible light and triggering a 
charge transfer. General solutions for interfacing visible-light-induced 
photoredox chemistry and asymmetric catalysis with single catalysts 
are highly desirable, and will potentially provide new opportunities for 
reaction design by having a closer control over the entire reaction path, 
including the crucial stereodiscrimination step. 

Taking into account that currently used visible-light photosensitizers 
are typically based on transition-metal complexes**, and that chiral 


transition-metal complexes constitute an established class of catalysts 
for asymmetric transformations’, we envisioned the combination of these 
two features into a single transition-metal-based asymmetric photore- 
dox catalyst. We conducted our study with the recently developed chiral- 
at-metal iridium(1m) complex A-Ir1” and the derivative A-Ir2 (Fig. 1). 
In both complexes, the octahedral iridium centre is coordinated by two 
achiral bidentate ligands in a left (A)- or right (A)-handed propeller-type 
fashion, thereby establishing metal-centred chirality**”’, and coordinated 
by two additional labile acetonitriles which give access to a Lewis acid 
metal centre upon ligand exchange (see Supplementary Fig. 1 fora crystal 
structure of A-Ir2). Our laboratory has previously reported the activa- 
tion of «,B-unsaturated 2-acyl imidazoles by chiral-at-metal A- or A-Irl 
as a step towards the enantioselective addition of indole nucleophiles”, 
so we considered the possibility that these chiral Lewis acids might be 
capable of intertwining chiral enolate catalysis*® with photoredox rad- 
ical ion chemistry*~: we therefore selected the model reaction of 2-acyl 
imidazole 1a with the electron deficient benzyl bromide 2a as our start- 
ing point (see Table 1). Encouragingly, in the presence of light from a 
14 W energy-saving household lamp, A-Irl at a loading of 5 mol% was 
able to catalyse the reaction between 1a and 2a, providing the a-alkylation 
product 3a in good yield (85%) and with high enantioselectivity (95% 
enantiomeric excess, e.e.) after 20 h photolysis at room temperature (entry 
1 of Table 1). Optimization of the reaction conditions—by empirically 
adjusting the solvent, increasing the concentration to speed up the reac- 
tion, slightly raising the temperature to promote ligand exchange at the 
iridium centre, and adding the weak base Na,HPO, to facilitate enolate 
chemistry—provided the product 3a in an excellent yield of 97% with 
95% e.e. after exposure to visible light for just 3h in the presence of a 
reduced catalyst loading of just 2mol% A-Irl1 (entry 2). The catalyst 
A-Ir2 (2 mol%) even provided the a-alkylation product in quantitative 
yield with a superior enantioselectivity of 99% e.e. and a further reduced 
reaction time of 1.5h (entry 3). We attribute the improved enantios- 
electivity to an increased steric hindrance in A-Ir2 compared to A-Irl 
created by the long C-S bonds of the benzothiazole moieties, which 
position the two tert-butyl groups somewhat closer to the exchange- 
labile acetonitrile ligands (see Supplementary Fig. 2). The loading of the 
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Figure 1 | Chiral iridium complexes for asymmetric photoredox catalysis. 
S, substrate; I, intermediate; P*, non-racemic chiral product. The Ir centre acts 
as a chiral centre, a catalytic centre, and a photoredox centre. 
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Table 1 | Initial iridium-catalysed photoinduced enantioselective alkylation of acyl imidazole 1a with benzyl bromide 2a 


A-Irt or A-Ir2 e) CN 
ai oom visible diclidl?m Cc g 
\ 2 
ql N. Ph NO, 
(R)-3a 

Entry Catalyst Illumination* Reaction conditions t(h) Yieldt (%) e.e.8 (%) 
1 A-Ir1 (5 mol%) Visible light 1a (0.3 M, 3 equiv.), MeOH, RT 20 85 95 
2 A-Ir1 (2 mol%) Visible light NasHPO,, 1a (1.2 M, 3 equiv.), MeOH/THF (4:1), 40°C 3 97 95 
3 A-Ir2 (2 mol%) Visible light Same as above 1.5 100 99 
4 A-Ir2 (0.5 mol%) Visible light Same as above 4.5 o7 98 
5 A-Ir2 (2 mol%) Dark Same as above LS <5 ND 
6 None Visible light Same as above 16 0) NA 
RT, room temperature; ND, not determined; NA, not applicable. 
*Light source: 14 W white light energy-saving lamp. 
+All reactions performed under the exclusion of air. See Supplementary Methods for more details. 


t lsolated yields. 
§ Enantiomeric excess determined by HPLC analysis on chiral stationary phase. 


catalyst A-Ir2 can be further decreased to merely 0.5 mol% without 
much affecting the yield (97%) or the enantioselectivity (98% e.e.) (entry 4). 
We note that neither the catalyst A-Ir2 alone in the dark (entry 5) nor 
visible light in the absence of the catalyst (entry 6) trigger this reaction 
toa significant degree under these conditions, thus unequivocally dem- 
onstrating that it is the combination of iridium(1) complex and visible 
light that is necessary to efficiently catalyse the enantioselective C-C 
bond formation. 

Examples of the photoinduced enantioselective o-alkylation of 2-acyl 
imidazoles with benzyl bromides catalysed by A-Ir2 are summarized in 
Fig. 2. A variety of electron acceptor substituted benzyl bromides pro- 
vide the o-alkylation products in up to quantitative yields (97-100%) 
and with up to almost perfect enanantioselectivities (94-99% e.e.), while 
requiring only short reaction times of 1.5 to 6 h (3a-d). The 2-acyl-N- 
methylimidazole substrates tolerate steric (products 3e and 3f), electron 
donating (product 3g) and electron accepting (product 3h) substituents 
in the phenyl moiety, which can be replaced by the bicyclic aromatic 
naphthalene (product 3i) or the heteroaromatic thiophene (product 3)). 
Furthermore, the photoredox catalysed reaction also tolerates less acidic 
2-acyl-N-methylimidazoles devoid of any aromatic substituent at the 
methylene group, as demonstrated for the products 3k and 31. For these 
substrates, the addition of a weak base is essential to achieve high con- 
versions and excellent enantioselectivities. We also tested a different class 
of electrophiles, namely phenacyl bromides, and found that they readily 
provide the expected C-C bond formation products with very good yields 
of 86-91% and high enantioselectivities of 90-91% e.e. (products 3m-o). 
In order to reach satisfactory enantioselectivities, the N-methyl substi- 
tuent at the imidazole moiety needed to be replaced by the more bulky 
isopropyl group. Overall, it can be concluded that A-Ir2 is a highly 
effective catalyst for the o-alkylation of acyl imidazoles with acceptor 
substituted benzyl bromides and phenacyl bromides in the presence of 
visible light with high to quantitative yields and impressive enantios- 
electivities, while only using a catalyst loading of 2 mol%. 

A plausible mechanism in which photoredox catalysis intertwines 
with asymmetric catalysis is shown in Fig. 3. Herein, the catalysis is ini- 
tiated by the coordination of 2-acyl imidazoles (1) to the iridium catalyst 
in a bidentate fashion (intermediate I), followed by the formation of a 
nucleophilic iridium (m1) enolate complex (intermediate IT) upon depro- 
tonation. The subsequent chirality generating key step constitutes the 
exergonic addition of a photo-reductively generated electrophilic radi- 
cal to the electron rich metal-coordinated enolate double bond, thereby 
affording an iridium-coordinated ketyl radical (intermediate IT). Oxi- 
dation of this ketyl intermediate to a ketone by single electron transfer 
regenerates the iridium() photosensitizer and provides the iridium- 
coordinated product (complex IV), which is released upon exchange 
with unreacted starting material, followed by a new catalytic cycle. The 
proposed key intermediate which uniquely connects the asymmetric 
catalysis with the photoredox cycle is the iridium(1m) enolate complex II, 


which not only provides the crucial asymmetric induction in the catalysis 
cycle and but at the same time serves as the in situ generated active chiral 
photosensitizer’’. 


(@) 
1 
ss ak + Bra 
Noe 
1 


A-Ir2 (2 mol%) ie) 
visible light 


Na,HPO, (1.1 equiv.) Nae R 
40°C 


2 3a-o 
fe) CN f) NO, fe) CO,Me 
\ 5 \ 5 \ 5 
NO Ph NO, N Ph NO, NO Ph NO, 
(R)-3a (R)-3b (R)-3c 
1.5h, 100% yield, 99% e.e. 2h, 97% yield, 99% e.e. 2h, 98% yield, 99% e.e 
NO. 


Se 
CF 

Fal 

( } 5 
: fe) 
io 


NO, NO, \ 
(R)-3d 
6h, 97% yield, 94% e.e. R)-3e 


2h, 99% yield, 99% e.e. 


(R)-3f 
2h, 93% yield, 99% e.e. 


NO, 


0 3°" 


OMe Cl Ss 
(R)-3g (R)-3h (R)-3i 
2h, 97% yield, 97% ee. 2h, 98% yield, 96% e.e. 2h, 99% yield, 98% e.e. 
o NO, fe) CN fe) NO, 
N 
Is 
: Ny Ng 
ve Ger oe B \ = 
MS o) NO, N Me NO, Ch Et NO, 
Ss 


(R)-3j 
2 h, 99% yield, 97% e.e. 


i 


ph 9 


(S)-3k 
24 h, 87% yield, 97% e.e. 


yD ob 


oo 
2 h, 86% yield, 91% e.e. 


(S)-31 
36 h, 84% yield, 91% e.e. 


*y 


(R)-3m 
6h, 91% yield, 90% e.e. 


(R)-80 
24 h, 91% yield, 91% e.e. 


Figure 2 | Substrate scope of the photoinduced enantioselective alkylation 
of 2-acyl imidazoles with acceptor substituted benzyl bromides and 
phenacyl bromides. Top row, the studied reaction; all other rows show 
products, giving reaction time, isolated yields after chromatographic 
purification, and enantiomeric excess (e.e.), which was determined by HPLC 
on a chiral stationary phase. Product (S)-3k: for comparison, in the absence 
of base a yield of 18% with 91% e.e. was obtained after photolysis for 24h. 
Product (S)-3]: reaction was irradiated instead with a blue LED light source 
(3 W) in order to improve the yield. EWG, electron withdrawing group. 
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Figure 3 | Plausible mechanism for a combined photoredox and 
asymmetric catalysis. For variations of this mechanism, see Supplementary 
Fig. 7. SET, single electron transfer; EWG, electron withdrawing group; PS, 
photosensitizer in the form of enolate complex II. See main text for details. 


A series of investigations support this mechanism. To start with, as 
confirmed by X-ray crystallography (Supplementary Fig. 3), 2-acyl-N- 
methylimidazoles efficiently coordinate to the iridium catalyst A-Ir2 
in a bidentate fashion upon release of the two monodentate acetonitrile 
ligands, thereby providing the proposed intermediate complex I. Sub- 
sequent deprotonation generated the intermediate iridium(m) enolate 
complex II, which was independently isolated and unambiguously char- 
acterized by X-ray crystallography, as shown in Fig. 4a. This structure 
also illustrates that one face of the prochiral enolate n-bond is blocked 
bya tert-butyl group, thereby rationalizing the observed high asymmet- 
ric induction in the course of the proposed diastereoselective addition 
of the electron-deficient radical to the electron-rich double bond”. The 
determined absolute configurations of the «-alkylation products are con- 
sistent with this mechanistic picture. This reaction step relates to recent 
reports of the photosensitized generation of electron-deficient radicals 


CN 


can ol 4 (67 %yield) 
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and their stereoselective addition to electron-rich 1 systems of chiral 
enamines”’*. For our system, a radical mechanism is consistent with 
the observation that photoreactions in the presence of air, the alkene 
1,1-diphenylethylene (isolation of adduct 4) or the radical trap 2,2,6,6- 
tetramethylpiperidine-1-oxyl (TEMPO) (isolation of products 5 and 6) 
suppress the formation of the o-alkylation product (Fig. 4b). Further- 
more, the direct correlation between photolysis and product formation 
is demonstrated by a light-dark interval reaction shown in Fig. 4c. 
Importantly, several experimental results lead to the conclusion that 
the intermediate iridium(11) enolate complex IT is not only a key nucle- 
ophilic intermediate in the asymmetric catalysis cycle, but also consti- 
tutes the active in situ assembled photosensitizer in the photoredox 
cycle. Since iridium (1m) complex II apparently represents the only neutral 
iridium(m) complex within the reaction mixture, this conclusion is also 
consistent with observed trends regarding redox and photophysical prop- 
erties of iridium(1m) complexes—namely that neutral bis-cyclometalated 
iridium(u1) photosensitizers are significantly stronger photoreducing 
agents than their cationic counterparts*. Accordingly, Stern-Volmer 
plots (Fig. 4d) illustrate that the luminescence emission of the enolate 
complex II is quenched by benzyl bromide 2a much more efficiently 
compared to Ir2, which can be attributed to a fast electron transfer from 
a triplet excited state of enolate complex II to the electron-deficient benzyl 
bromide. This is furthermore supported by cyclic voltammetry (Sup- 
plementary Fig. 4), which reveals that the enolate complex IT has a sig- 
nificantly decreased oxidation potential (by around 1 V) compared to 
the cationic complex Ir2, and thus comprises a much stronger reducing 
agent in the ground state and even more so in its photoexcited state 
(Fig. 4e). The estimated excited state redox potential E, p(II /TI*) of 
—1.74V versus Ag/AgCl for the enolate complex II is comparable to 
that of fac-[Ir(ppy)3] (ppy = 2-phenylpyridine)’, an iridium sensitizer 
that has been used previously for the reductive cleavage of electron defi- 
cient benzyl bromides’°. Conveniently, compared to Ir2 and established 
iridium(m1) photosensitizers*, the enolate complex II displays a batho- 
chromically shifted long wavelength absorbance maximum with an 
additional shoulder at around 500 nm, thus permitting an excitation 


Figure 4 | Mechanistic investigations. a, X-ray 
crystal structure of the proposed Ir(i1) enolate 
complex intermediate II. This compound was 
crystallized as a racemic mixture, and only the 
A-enantiomer is shown here, as an ORTEP 
drawing with 30% probability ellipsoids. b, Control 
experiments in the presence of molecules which 
react with radicals. See main text for details. 

c, Light-dark interval experiment for the reaction 
la + 2a— 3a according to entry 3 of Table 1. 

d, Luminescence quenching experiments. Ip and I 
are respectively luminescence intensities in the 
absence and presence of the indicated 
concentrations of the electron deficient benzyl 
bromide 2a. e, Comparison of the photo and redox 
properties of catalyst A/A-Ir2 and the enolate 
complex II. Columns 2 and 3 show wavelength of 
maximum absorbance and emission, respectively. 
E°°, energy of the emitting excited state as 
calculated from the luminescence peak. Column 4 
shows the one-electron redox potential of the 
couple oxidized sensitizer / sensitizer as 
determined from the peak maximum of differential 
pulse voltammetry. Column 5 shows the one- 
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across half of the visible spectrum ranging from violet to green light 
(Fig. 4e and Supplementary Fig. 5). The visible-light absorbance of eno- 
late complex II is not affected by the presence of organic bromide sub- 
strates, thus most probably ruling out the possibility that an electron 
donor-acceptor complex between enolate complex IT and bromide sub- 
strate is responsible for the light absorption (Supplementary Fig. 6)*”. 
It is also worth noting that an independently synthesized enolate com- 
plex II is catalytically competent and catalyses the photoredox reaction 
with an identical efficiency compared to Ir2, thereby supporting the 
notion that complex II has a dual function as a chiral nucleophile in the 
catalytic cycle and the in situ photosensitizer in the photoredox cycle. 
Finally, two distinct variations of the outlined mechanism need to be 
considered (Supplementary Fig. 7). First, instead of regenerating the pho- 
tooxidized sensitizer in every cycle (ps* +e — PS, Fig. 3), the ketyl 
intermediate III might transfer a single electron directly to another bro- 
mide substrate, thus skipping the photoredox cycle and leading to a 
chain reaction. Although the direct light-dependence of the asymmetric 
photoactivated catalysis shown in Fig. 4c suggests that the asymmetric 
catalysis and photoredox cycle operate in concert at least to some extent, 
a contribution of the chain propagation mechanism as a function of the 
nature of the substrates is feasible, and might even explain the differences 
in photolysis times for the individual reactions”’. The second mechanis- 
tic variation to discuss revolves around the direct reaction of the inter- 
mediate benzyl radical with the oxidized sensitizer (PS*). However, a 
major contribution of this recombination process is unlikely since both 
reactive intermediates will not be generated in close proximity, consid- 
ering that the fragmentation of the formed radical anion does not occur 
instantaneously. In this respect, it has been established that the life time 
of such radical anions significantly increases in protic solvents and with 
a decreasing energy of the n* orbitals**. This notion is supported by an 
experiment in which low concentrations of the radical trap TEMPO were 
still able to capture the intermediate benzyl radical (see Supplementary 
Methods), thus rendering unlikely an efficient recombination of the 
benzyl radical with the oxidized iridium sensitizer, and instead favour- 
ing an addition of the intermediate electron-deficient benzyl radical with 
the electron-rich m-bond of the iridium(m) enolate complex I, which 
is present in solution at a much higher steady state concentration. 
We have reported a unique case of visible-light-induced asymmetric 
redox catalysis by a single, structurally simple catalyst. The two catalytic 
cycles are apparently connected through an intermediate iridium(1m) 
enolate complex, formed from the initial catalyst and the 2-acyl imida- 
zole substrate, which is not only the key nucleophilic intermediate in 
the asymmetric catalysis cycle but also constitutes the in situ generated 
active visible-light photosensitizer. The reaction scheme that we intro- 
duce here may serve as a blueprint for the design of other catalytic asym- 
metric photoredox reactions, and will most probably provide new avenues 
for the efficient and green synthesis of non-racemic chiral molecules. 


Received 4 August; accepted 23 September 2014. 


1. Walsh, P. J. & Kozlowski, M. C. Fundamentals of Asymmetric Catalysis (University 
Science Books, 2009). 

2. Zeitler, K. Photoredox catalysis with visible light. Angew. Chem. Int. Edn 48, 
9785-9789 (2009). 

3. Narayanam, J.M.R. & Stephenson, C. R. J. Visible light photoredox catalysis: 
applications in organic synthesis. Chem. Soc. Rev. 40, 102-113 (2011). 

4. Prier,C.K., Rankic, D.A.& MacMillan, D. W.C. Visible light photoredox catalysis with 
transition metal complexes: applications in organic synthesis. Chem. Rev. 113, 
5322-5363 (2013). 

5.  Schultz,D.M.& Yoon, T. P.Solarsynthesis: prospects in visible light photocatalysis. 
Science 343, 1239176 (2014). 

6. Schmittel, M. & Burghart, A. Understanding reactivity patterns of radical cations. 
Angew. Chem. Int. Edn Engl. 36, 2550-2589 (1997). 

7. Curran, D. P., Porter, N. A. & Giese, B. Stereochemistry of Radical Reactions: 
Concepts, Guidelines, and Synthetic Applications (VCH, 1996). 

8. Hopkinson, M. N., Sahoo, B., Li, J.-L. & Glorius, F. Dual catalysis sees the light: 
combining photoredox with organo-, acid, and transition-metal catalysis. Chem. 
Eur. J. 20, 3874-3886 (2014). 


LETTER 


9. Nicewicz, D. A. & MacMillan, D. W. C. Merging photoredox catalysis with 
organocatalysis: the direct asymmetric alkylation of aldehydes. Science 322, 
77-80 (2008). 

10. Shih, H.-W., Vander Wal, M. N., Grange, R. L. & MacMillan, D. W. C. Enantioselective 
a-benzylation of aldehydes via photoredox organocatalysis. J. Am. Chem. Soc. 132, 
13600-13603 (2010). 

11. Neumann, M.,Fildner, S., Konig, B. & Zeitler, K. Metal-free, cooperative asymmetric 
organophotoredox catalysis with visible light. Angew. Chem. Int. Edn 50, 951-954 
(2011). 

12. Cherevatskaya, M. et al. Visible-light-promoted stereoselective alkylation by 
combining heterogeneous photocatalysis with organocatalysis. Angew. Chem. Int. 
Edn 51, 4062-4066 (2012). 

13. Nagib, D.A., Scott, M. E. & MacMillan, D. W. C. Enantioselective o- 
trifluoromethylation of aldehydes via photoredox organocatalysis. J. Am. Chem. 
Soc. 131, 10875-10877 (2009). 

14. DiRocco, D. A. & Rovis, T. Catalytic asymmetric #-acylation of tertiary amines 
mediated by a dual catalysis mode: N-heterocyclic carbene and photoredox 
catalysis. J. Am. Chem. Soc. 134, 8094-8097 (2012). 

15. Tarantino, K.T., Liu, P.& Knowles, R. R. Catalytic ketyl-olefin cyclizations enabled by 
proton-coupled electron transfer. J. Am. Chem. Soc. 135, 10022-10025 (2013). 

16. Du, J., Skubi, K. L., Schultz, D. M. & Yoon, T. P. A dual-catalysis approach to 
enantioselective [2 + 2] photocycloadditions using visible light. Science 344, 
392-396 (2014). 

17. Bergonzini, G., Schindler, C. S., Wallentin, C.-J., Jacobsen, E. N. & Stephenson, 

C. R. J. Photoredox activation and anion binding catalysis in the dual catalytic 
enantioselective synthesis of B-amino esters. Chem. Sci. 5, 112-116 (2013). 

18. Bauer, A., Westkamper, F., Grimme, S. & Bach, T. Catalytic enantioselective 
reactions driven by photoinduced electron transfer. Nature 436, 1139-1140 
(2005). 

19. Miller, C., Bauer, A. & Bach, T. Light-driven enantioselective organocatalysis. 
Angew. Chem. Int. Edn 48, 6640-6642 (2009). 

20. Brimioulle, R. & Bach, T. Enantioselective Lewis acid catalysis of intramolecular 
enone [2+2] photocycloaddition reactions. Science 342, 840-843 (2013). 

21. Alonso, R. & Bach, T. A chiral thioxanthone as an organocatalyst for 
enantioselective [2+2] photocycloaddition reactions induced by visible light. 
Angew. Chem. Int. Edn 53, 4368-4371 (2014). 

22. Arceo, E., Jurberg, |. D., Alvarez-Fernandez, A. & Melchiorre, P. Photochemical 
activity of a key donor-acceptor complex can drive stereoselective catalytic 
a-alkylation of aldehydes. Nature Chem. 5, 750-756 (2013). 

23. Huo, H., Fu, C., Harms, K. & Meggers, E. Asymmetric catalysis with substitutionally 
labile yet stereochemically stable chiral-at-metal iridium(IIl) complex. J. Am. Chem. 
Soc. 136, 2990-2993 (2014). 

24. Fontecave, M., Hamelin, O. & Ménage, S. Chiral-at-metal complexes as asymmetric 
catalysts. Top. Organomet. Chem. 15, 271-288 (2005). 

25. Bauer, E. B. Chiral-at-metal complexes and their catalytic applications in organic 
synthesis. Chem. Soc. Rev. 41, 3153-3167 (2012). 

26. Evans, D.A., Downey, C. W. & Hubbs, J. L. Ni(II) bis(oxazoline)-catalyzed 
enantioselective syn aldol reactions of N-propionylthiazolidinethiones in the 
presence of silyl triflates. J. Am. Chem. Soc. 125, 8706-8707 (2003). 

27. Sato, H. & Yamagishi, A. Application of the AA isomerism of octahedral metal 
complexes as a chiral source in photochemistry. J. Photochem. Photobiol. C 8, 
67-84 (2007). 

28. Herrmann, A. T., Smith, L. L. & Zakarian, A. A simple method for asymmetric 
trifluoromethylation of N-acyl oxazolidinones via Ru-catalyzed radical addition to 
zirconium enolates. J. Am. Chem. Soc. 134, 6976-6979 (2012). 

29. Studer, A. & Curran, D. P. The electron is a catalyst. Nature Chem. 6, 765-773 
(2014). 

30. Andrieux, C. P.,Le Gorande, A. & Savéant, J. M. Electron transfer and bond breaking. 
Examples of passage from a sequential to a concerted mechanism in the 
electrochemical reductive cleavage of arylmethy| halides. J. Am. Chem. Soc. 114, 
6892-6904 (1992). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements We acknowledge funding from the German Research Foundation 
(ME 1805/4-1). H.H. thanks the China Scholarship Council for a stipend. 


Author Contributions E.M. conceived and coordinated the project and wrote the Letter. 
E.M. and H.H. designed the experiments. H.H. carried out the majority of the 
experiments. X.S. synthesized the new catalyst A-Ir2. C.W. contributed to the synthesis 
of substrates. L.Z. contributed to the synthesis and crystallization of iridium complexes. 
L-A.C. provided insights into iridium enolate chemistry. P.R. performed and analysed 
the cyclic voltammetry under supervision of G.H. The X-ray crystallographic studies 
were performed by K.H. and M.M. 


Author Information The X-ray crystallographic coordinates for structures of the iridium 
complex A-Ir2, substrate coordinated iridium complex I and the iridium enolate 
complex II have been deposited at the Cambridge Crystallographic Data Centre 
(CCDC) under deposition numbers CCDC 1014509, 1014510 and 1014876, 
respectively. Reprints and permissions information is available at www.nature.com/ 
reprints. The authors declare no competing financial interests. Readers are welcome to 
comment on the online version of the paper. Correspondence and requests for 
materials should be addressed to E.M. (meggers@chemie.uni-marburg.de). 


6 NOVEMBER 2014 | VOL 515 | NATURE | 103 


©2014 Macmillan Publishers Limited. All rights reserved 


| sid Wal Be 


doi:10.1038/nature13857 


Recent Northern Hemisphere stratospheric HCl 
increase due to atmospheric circulation changes 


E. Mahieu’, M. P. Chipperfield’, J. Notholt*, T. Reddmann’, J. Anderson’, P. F. Bernath®”*, T. Blumenstock’, M. T. Coffey’, 
S. S. Dhomse?, W. Feng’, B. Franco!, L. Froidevaux!°, D. W. T. Griffith", J. W. Hannigan’, F. Hase*, R. Hossaini?, N. B. Jones", 
I. Morino’, I. Murata!’, H. Nakajima!, M. Palm?, C. Paton-Walsh", J. M. Russell III°, M. Schneider‘, C. Servais', D. Smale!* 


& K. A. Walker® 


The abundance of chlorine in the Earth’s atmosphere increased con- 
siderably during the 1970s to 1990s, following large emissions of an- 
thropogenic long-lived chlorine-containing source gases, notably the 
chlorofluorocarbons. The chemical inertness of chlorofluorocarbons 
allows their transport and mixing throughout the troposphere on 
a global scale’, before they reach the stratosphere where they release 
chlorine atoms that cause ozone depletion’. The large ozone loss over 
Antarctica*® was the key observation that stimulated the definition 
and signing in 1987 of the Montreal Protocol, an international treaty 
establishing a schedule to reduce the production of the major chlor- 
ine- and bromine-containing halocarbons. Owing to its implementa- 
tion, the near-surface total chlorine concentration showed a maximum 
in 1993, followed by a decrease of half a per cent to one per cent per 
year’, in line with expectations. Remote-sensing data have revealed a 
peak in stratospheric chlorine after 1996’, then a decrease of close to 
one per cent per year®’, in agreement with the surface observations 
of the chlorine source gases and model calculations’. Here we present 
ground-based and satellite data that show a recent and significant 
increase, at the 2a level, in hydrogen chloride (HCl), the main stra- 
tospheric chlorine reservoir, starting around 2007 in the lower stra- 
tosphere of the Northern Hemisphere, in contrast with the ongoing 
monotonic decrease of near-surface source gases. Using model simu- 
lations, we attribute this trend anomaly to a slowdown in the Northern 
Hemisphere atmospheric circulation, occurring over several con- 
secutive years, transporting more aged air to the lower stratosphere, 
and characterized by a larger relative conversion of source gases to 
HCL. This short-term dynamical variability will also affect other stra- 
tospheric tracers and needs to be accounted for when studying the 
evolution of the stratospheric ozone layer. 

Decomposition of chlorine-containing source gases in the stratosphere 
produces HCl, the largest reservoir of chlorine*’. Here we investigate 
recent trends in atmospheric HCl with observations from eight Network 
for the Detection of Atmospheric Composition Change (NDACC; http:// 
www.ndacc.org) ground-based stations located between 79° N and 45° S 
and operating Fourier Transform InfraRed (FTIR) instruments. Figure la 
shows the HC] total columns for Jungfraujoch (47° N; red squares) to- 
gether with the evolution of the total tropospheric chlorine (blue curve) 
over the past three decades. Figure 1b—d focuses on the recent HCl changes 
above Ny-Alesund (79° N) and two mid-latitude stations, Jungfraujoch 
(zoom of Fig. la) and Lauder (45° S). 

At the Southern Hemisphere station we find a continuous decrease 
of HCI since 2001, but both Northern Hemisphere sites show an overall 
HCI decline, more rapid around 2004, followed by an increase from 
2007 onwards. To quantify the column changes at all sites, we used a 


bootstrap resampling statistical tool”® involving a linear component and 
accounting for the strong seasonal modulations present in the data sets. 
Figure 2 displays, for the eight NDACC sites, the relative annual HCl 
rates of change for the 1997-2007 and 2007-2011 time periods, using 
either the 1997.0 or 2007.0 computed column as reference. For the 1997- 
2007 time interval, we determine consistent and significant HCl decreases 
at all Northern Hemisphere sites, with mean relative changes ranging 
from —0.7 to — 1.5 per cent per year. In the Southern Hemisphere, column 
changes are not significant at the 2¢ level. For 2007-2011, mean relative 
column growths of 1.1-3.4 per cent per year are derived for all Northern 
Hemisphere sites while negative or undefined rates are observed for 
Wollongong and Lauder in the Southern Hemisphere. 

To corroborate these findings with independent data, and to get infor- 
mation on the altitude range where these changes occur, we included the 
GOZCARDS" satellite data set (Global OZone Chemistry And Related 
Data sets for the Stratosphere version 1.01), which merges observations 
by the HALOE” (HALogen Occultation Experiment version 19), ACE- 
FTS* (Atmospheric Chemistry Experiment-Fourier Transform Spectrom- 
eter version 2.2) and Aura/MLS" (Microwave Limb Sounder version 
3.3) instruments. Partial columns were computed between 100 hPa and 
10 hPa, considering the zonal monthly mean mixing ratio time series 
available for the whole time interval in the 70°-80° N, 60°-70° N, 40°- 
50° N, 30°-40° N, 20°-30° N, 30°-40° S and 40°-50° S latitudinal bands. 
These partial columns typically span altitudes of 16-31 km, that is, the 
region with maximum HCl concentration and in which the FTIR mea- 
surements are most sensitive’. 

Corresponding rates of change are also displayed in Fig. 2. For 1997- 
2007, there is excellent agreement in the Northern Hemisphere between 
the satellite and the six NDACC-FTIR trends determined above. In the 
Southern Hemisphere, GOZCARDS reveals statistically significant de- 
creases of HC] at the 2c level, while the FTIR time series suggest stable 
columns at the same level of confidence. For 2007-2011, the ACE-FTS 
and Aura/MLS merged data confirm the upward FTIR trends in the 
Northern Hemisphere. Figure 3 illustrates this, showing satellite monthly 
means (red dots) for 30°-60° N and 30°-60° S, at 46 hPa and 7 hPa, 
together with a linear fit to the data for both time periods. The HCl in- 
crease is clearly confined to the Northern Hemisphere lower stratosphere. 

Because HCl is the main final product of the decomposition of any 
chlorine-containing source gases, we need to verify that its rise after 
2007 does not result from the substantial contribution of new unknown 
sources of chlorine whose emissions occur predominantly in the North- 
ern Hemisphere, not monitored by the in situ networks, and unregu- 
lated by the Montreal Protocol, its Amendments and Adjustments. Indeed, 
such chlorine-containing source gases have been recently identified’, 
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Figure 1 | Evolution of HCl in the Earth’s atmosphere. a, The long-term total 
column time series of HC] at Jungfraujoch (running average with a 3-yr 
integration length, step of 1 month; in red, left scale) and the global total 
tropospheric chlorine volume mixing ratio (blue curve, right scale, in parts per 
trillion, p.p.t). The lower panels display the running average total column time 


although in that case, their contribution to the HCl upturn can be ruled 
out by their very low concentrations. 

We have used results from two state-of-the-art three-dimensional 
chemical transport models, SLIMCAT’ and KASIMA’, to interpret the 
recent HCl increase. Both models performed a standard simulation 
using surface source gas mixing ratios from the WMO A1 (World Mete- 
orological Organisation; 2010) emission scenario* and were forced us- 
ing ERA-Interim meteorological fields’* from the European Centre for 
Medium-Range Weather Forecasts (ECMWE). The key results for HCl 
trends from both models agree. Here we show data from the SLIMCAT 
runs; corresponding results from KASIMA are shown in Extended Data 
Figs 1-4. To study the impact of atmospheric dynamics, an additional 
SLIMCAT run (S2000) used constant 2000 meteorological forcing, from 
2000 onwards. 

Running averages for both SLIMCAT simulations are reproduced in 
Fig. 1b-d. For the three sites, run $2000 (light green curve) predicts an 
overall HCl decrease while the standard run (green squares) reproduces 
the observed and distinct evolution prevailing in both hemispheres, after 
correction ofa constant low-bias of about 7% in the Northern Hemisphere 
simulations. The total column changes characterizing the model data 


series (1997-2011) of HCl at Ny-Alesund (b), Jungfraujoch (c) and Lauder (d), 
derived from the NDACC-FTIR observations, and the standard (green) and 
$2000 (light green) SLIMCAT simulations. The thin red lines correspond to 
the +2 standard error of the mean range. Minimum columns are observed in 
July 2007 at the Northern Hemisphere sites (dashed lines). 


sets are displayed in Fig. 2. The model runs predict significant de- 
creases in HCl for the 1997-2007 reference period at all sites and there 
is an overall agreement within the error bars for the amplitude of the 
signals between the model and the observations. Regarding the 2007- 
2011 time period, the SLIMCAT time series are characterized by positive 
trends from Ny-Alesund (79° N) to Tsukuba (36° N) and by significant 
decreases for the Southern Hemisphere stations, but show no change 
for the near-tropical site of Izana (28° N). The $2000 sensitivity run does 
not produce the HCl trend reversal and, instead, indicates declines at 
all sites. 

The agreement between measurement and model demonstrates that 
the HCl increase after 2007 is not caused by new, unidentified chlorine 
sources, or by underestimates in emissions of known species of chlorine- 
containing source gases, because these are used as model input. The 
agreement between model and observation also shows that there is a 
good understanding of the chemistry which converts source gases to HCl. 
The difference between the HCl trends forecast by the two SLIMCAT 
runs—that is, a significant increase for northern high- and mid-latitudes 
or a constant decrease below 30° N—establishes that changes in the 
atmospheric circulation cause the recent HCl increase, since only the 


a 1997-2007 b 2007-2011 Figure 2 | HCl relative rates of change for eight 
NDACC sites. a, The rates of change (per cent per 
79° N 4 a TE | Ny-Alesund —_year) for the 1997-2007 time period (1999-2007 
_— ean for Thule and Izana, 1998-2007 for Tsukuba). 
77, ‘SERB == cozc — hl L Thule b, As for a but for 2007-2011. The rates of change 
" mee SLIMCAT ; were derived from the FTIR and GOZCARDS 
68° N 4 = mms 2000 ae r Kiruna observational data sets and from the two SLIMCAT 
een , simulated time series (see colour key). The error 
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Figure 3 | Evolution of stratospheric HCl from 
satellite observations. Comparison of merged 
GOZCARDS satellite HCl observations (by 
HALOE, ACE-FTS and Aura/MLS) with 
SLIMCAT model runs for Northern Hemisphere 
and Southern Hemisphere mid-latitude lower 
(46 hPa) and upper (7 hPa) stratosphere. 
GOZCARDS monthly means are shown as red 
dots. Linear fits to the GOZCARDS data and 
standard SLIMCAT run are displayed as red and 
green lines, respectively, for periods before and 
after 2005. The dashed black line shows fits to the 


HCl volume mixing ratio (p.p.b.) 
° 


30°-60° N, 46 hPa $2000 run, which assumes no change in circulation. 
An upward trend is observed in the Northern 
Hemisphere lower stratosphere (d) while HCl is 
decreasing in the southern and northern upper 
stratosphere (a, b); volume mixing ratio in parts per 


billion (p.p.b.). 
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meteorological fields adopted from 2000 onwards differ between the 
two runs. To diagnose these circulation changes, we examined age-of- 
air maps produced by the standard SLIMCAT run. They reveal a slower 
circulation in the Northern Hemisphere lower stratosphere after 2005- 
2006, with older air characterized by a larger relative conversion of the 
chlorine-containing source gases into HCl. 

Figure 4b shows the age-of-air change between 2005-2006 and 2010- 
2011. Air older by up to 0.4 yr is found at altitudes of around 20-25 km 
in abroad range of Northern Hemisphere latitudes, in a region where the 
mean age-of-air is typically about 3 yr. There is an obvious correlation 
with the evolution of the HCl concentrations over the same time period 
(Fig. 4a), which exhibits a very similar pattern and hemispheric asym- 
metry. Time series of mean age-of-air near 50 hPa above Ny-Alesund, 
Jungfraujoch and Lauder are displayed in Fig. 4c. The 3-yr running 


A(HCI concentration) (1015 m-%) A(mean age) (yr) 


2006 2010 


means (black curves) indicate a progressive slowdown of the Northern 
Hemisphere stratospheric circulation after 2005-2006. For Lauder, a 
fairly constant circulation speedup occurs from 2000 onwards. 

These changes are significant at the 2a level, with Northern Hemisphere 
air ageing by 3-4 weeks per year after 2005, compared to about 1 week 
per year before. For Lauder, the mean age-of-air change during the last 
decade is calculated to be —2 weeks per year. Other important factors, 
such as the details of specific transport pathways, which lead to a given 
mean age-of-air, also affect the conversion rate of the source gases to 
HCI (ref. 17). These pathways are simulated by the model but not re- 
vealed by the simple diagnostic of mean age-of-air. The slower Northern 
Hemisphere circulation occurring over a few years after 2005-2006 
seems to contrast with the speedup of the Brewer—-Dobson circulation, 
which is predicted in the very long-term to be a response to climate 


Figure 4 | Spatial distribution of the HCl 
concentration and age-of-air changes. Mean 
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change'*”, but the recent slowdown is probably part of dynamical 
variability occurring on shorter timescales: it does not imply a change 
in the general circulation strength. More than year-to-year variability, 
it is multiyear periods of age-of-air increase or decrease, such as those 
highlighted in our study or reported recently”, that will probably com- 
plicate the search of a long-term trend in mean circulation. 

We have presented observations and simulations of a recent HCl 
increase in the Northern Hemisphere lower stratosphere. We ascribe it 
to dynamical variability, occurring on a timescale of a few years, char- 
acterized by a persistent slowing of stratospheric circulation after 2005, 
bringing HCl-enriched air into the Northern Hemisphere lower stra- 
tosphere. We find no evidence that unidentified chlorine-containing 
source gases are responsible for this HCl increase. In the Southern 
Hemisphere, a fairly constant decrease has been observed over the past 
ten years. Globally, our ground-based observations indicate a mean HCl 
decrease of 0.5 per cent per year for 1997-2011, compatible with the 
0.5-1 per cent per year range that characterized the post-peak reduc- 
tion of tropospheric chlorine’. Hence, we conclude that the Montreal 
Protocol is still on track, and is leading to an overall reduction of the 
stratospheric chlorine loading. However, multiyear variability in the 
stratospheric circulation and dynamics, as identified here, could lead 
to further unpredictable increases or redistribution of HCl and other 
stratospheric tracers. Therefore, such variability and its causes will have 
to be thoroughly characterized and carefully accounted for when evalu- 
ating trends or searching for ozone recovery. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


The ground-based observations were performed at the NDACC sites by solar absorp- 
tion spectrometry in the infrared spectral region, using FTIR high-resolution instru- 
ments. Observations are recorded under clear sky conditions year-round, except at 
Ny-Alesund and Thule, where the polar night prevents measurements between 
about October and February. The HC] total columns were retrieved with the SFIT- 
2, SFIT-4 or PROFFIT algorithm in narrow spectral ranges encompassing isolated 
lines of HCI”, generally assuming pressure-temperature profiles provided by the 
National Centers for Environmental Prediction (NCEP). The GOZCARDS"' data 
set for HCl includes zonal average monthly mean time series of stratospheric mixing 
ratio profiles merging individual measurements from the HALOE (1991-2005), 
ACE-FTS (2004 onward) and Aura MLS (2004 onward) satellite-borne instruments. 
Line parameters from recent HITRAN databases”! were adopted in the spectrometric 


analyses. We used the SLIMCAT and KASIMA models’ to support our investi- 
gations. Both used ERA-Interim analyses provided by ECMWF'"*, and they pro- 
vided consistent results for the HC] trends, giving confidence in their robustness. 
The models contain detailed treatments of stratospheric chemistry and have been 
extensively used for studies of stratospheric ozone’. Stratospheric age-of-air was 
diagnosed in the model runs using an idealized tracer with a linearly increasing 
tropospheric mixing ratio. For the $2000 SLIMCAT simulation, 6-hourly winds of 
2000 were used every year from 2000 onwards. The trend determinations were per- 
formed with a bootstrap resampling statistical tool’®, considering all available daily 
or monthly means (excluding the winter months for the very high-latitude sites) 
while the model data sets were limited to days with available FTIR measurements. 
We studied the impact of the FTIR sampling using the bootstrap algorithm, and 
found no statistically significant impact on the calculated trends. 
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Extended Data Figure 1 | Evolution of HCl in the Earth’s atmosphere and at Ny-Alesund (b), Jungfraujoch (c) and Lauder (d), derived from the 
comparison with KASIMA model results. a, The long-term total column time | NDACC-FTIR observations and from the KASIMA run (grey). The thin red 
series of HC] at Jungfraujoch (running average with a 3-yr integration length, _ lines correspond to the +2 standard error of the mean range. The vertical 
step of 1 month; in red, left scale, in molecules per cm”) and the global total dashed lines identify the occurrence of the minimum total columns at the 
tropospheric chlorine volume mixing ratio (blue curve, right scale). Lower Northern Hemisphere sites, in July 2007. 

panels display the running average total column time series (1997-2011) of HCl 
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Extended Data Figure 2 | HCl relative rates of change at eight NDACC sites. _ periods, respectively. They were derived from the FTIR and GOZCARDS 
aandb provide the rates of change (per cent per year) for the 1997-2007 (1999- _ observational data sets and from the SLIMCAT and KASIMA simulated time 
2007 for Thule and Izana, 1998-2007 for Tsukuba) and 2007-2011 time series (see colour key). The error bars correspond to the 2¢ level of uncertainty. 
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Extended Data Figure 3 | Evolution of stratospheric HCl from satellite 
observations. Comparison of merged GOZCARDS satellite HCl observations 
(by HALOE, ACE-FTS and Aura/MLS) with KASIMA model results for 
Northern and Southern Hemisphere mid-latitude lower (46 hPa) and upper 
(7 hPa) stratosphere. GOZCARDS monthly mean observations are shown as 


1998 
Year 
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2006 2010 


red dots. Linear fits to the GOZCARDS data and the KASIMA run are 
displayed as red and blue lines, respectively, for periods before and after 2005. 
An upward trend is observed and modelled in the Northern Hemisphere lower 
stratosphere (d) while HCl is decreasing in the southern and northern 

upper stratosphere (a, b); volume mixing ratio in parts per billion. 
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Extended Data Figure 4 | Spatial distribution of the HCl concentration and 
age-of-air changes. Mean differences of the HCl concentration (a) and age-of- 
air (b) between 2010/11 and 2005/06, as a function of altitude and latitude, 
derived from the KASIMA model simulation. c, Running averages of the mean 
age-of-air at 50 hPa (thick/thin curve, integration length of 36/6 months), at the 


same sites as in Fig. 1 (time series at 79° N/45° S have been shifted vertically by 
—0.75/—0.50 yr). Comparison with age-of-air time series derived from 
SLIMCAT (see Fig. 4c) indicates that KASIMA provides higher absolute values 
of mean age-of-air. Note that the upper boundary of KASIMA is at 120 km, 
yielding higher mean ages, compared to SLIMCAT (upper boundary 60 km). 
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Selection for niche differentiation in plant 
communities increases biodiversity effects 


Debra Zuppinger-Dingley', Bernhard Schmid", Jana S. Petermann?”*, Varuna Yadav’, Gerlinde B. De Deyn* & Dan F. B. Flynn"® 


In experimental plant communities, relationships between bio- 
diversity and ecosystem functioning have been found to strengthen 
over time’”, a fact often attributed to increased resource complemen- 
tarity between species in mixtures’ and negative plant-soil feedbacks 
in monocultures*. Here we show that selection for niche differenti- 
ation between species can drive this increasing biodiversity effect. 
Growing 12 grassland species in test monocultures and mixtures, we 
found character displacement between species and increased bio- 
diversity effects when plants had been selected over 8 years in species 
mixtures rather than in monocultures. When grown in mixtures, 
relative differences in height and specific leaf area between plant species 
selected in mixtures (mixture types) were greater than between species 
selected in monocultures (monoculture types). Furthermore, net bio- 
diversity and complementarity effects’? were greater in mixtures of 
mixture types than in mixtures of monoculture types. Our study dem- 
onstrates a novel mechanism for the increase in biodiversity effects: 
selection for increased niche differentiation through character dis- 
placement. Selection in diverse mixtures may therefore increase species 
coexistence and ecosystem functioning in natural communities and 
may also allow increased mixture yields in agriculture or forestry. How- 
ever, loss of biodiversity and prolonged selection of crops in monocul- 
ture may compromise this potential for selection in the longer term. 

Higher biodiversity promotes stability and productivity, with an in- 
creasing effect over time’”. These positive biodiversity effects on stability 
and productivity can arise from complementarity between species in 
resource use, such as partitioning of soil resources™® or, in some cases, 
accumulation of greater resources at high diversity’. Concurrently, the 
accumulation of natural enemies in low diversity, the Janzen—Connell 
effect*”, both promotes species coexistence through density-dependent 
mortality and limits productivity owing to the high pressure of species- 
specific pathogens*. Increasing biodiversity effects could therefore arise 
from increasingly complementary resource use and nutrient accumula- 
tion in mixtures or pathogen accumulation over time in monocultures. 
Here we propose a distinct, novel mechanism—that increased biodiversity 
effects over time result from selection for increased niche differentiation”® 
between plant species in diverse plant communities, reducing competi- 
tion between species. Character displacement”, as reflected in func- 
tional trait differences, may drive such increasing niche differentiation 
between species’. Our hypothesis predicts larger functional trait differ- 
ences between species in mixtures and, associated with such divergence 
in traits, stronger positive biodiversity effects on productivity. Selection 
for increased niche differentiation could thus explain the experiment- 
ally observed'” increasing biodiversity effects over time, and may have 
implications for the effect of biodiversity on agricultural and forestry 
production, in which genetic diversity is known to promote production” 
and biodiversity may be essential in maintaining the pace of produc- 
tion gains”. 

According to our hypothesis, mixture communities composed of 
the progeny of plants grown in conditions of high diversity (‘mixture 
types’) should have greater complementarity than mixture communities 


composed of the progeny of plants grown in monocultures (‘mono- 
culture types’). We hypothesize that heritable phenotypes resulting from 
distinct genotypic or epigenetic features, including maternal effects, can 
arise from selection pressures in diverse communities. Our hypothesis of 
selection for niche differentiation through character displacement pre- 
dicts adaptation to local diversity, with mixture types selected for niche 
differentiation and high performance in mixtures. We tested our hypo- 
thesis by growing mixture and monoculture types of 12 plant species— 
collected from selection communities of monocultures, mixtures of four 
or more species ofa single functional group, and mixtures of four or more 
species of four functional groups (grasses, small herbs, tall herbs and 
legumes)—in pots containing four individuals of one or two species in 
a glasshouse (Extended Data Fig. 1). Our selection communities (selec- 
tion history) were experimental plots of an 8-year biodiversity field ex- 
periment in Jena, Germany”, and our test communities (planted diversity) 
were the pots containing four individuals each (Fig. 1). We assessed above- 
ground plant biomass in mixtures versus monocultures to calculate bio- 
diversity effects, partitioned into complementarity and sampling effects’ 
(which are usually called selection effects’, a phrase we will not use here 
to avoid confusion). We consider higher net biodiversity and complemen- 
tarity effects for mixture types as an indication of niche differentiation, 
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Figure 1 | Experimental design. Plant material, shoots and roots (n = 4,900) 
from experimental field monoculture versus mixture selection communities 
established in Jena, Germany in 2002, were collected in 2010 and assembled in 
new experimental glasshouse monoculture versus mixture test communities in 
2010/11 (n = 855). We expected that mixture test communities would have 
higher productivity if assembled from plants collected from mixture selection 
communities in the field and vice versa for monocultures. Different shades of 
green represent the hypothesized selection for monoculture or mixture types 
from 2002 to 2010. 
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Figure 2 | Stronger biodiversity effects for plants selected in mixtures 
compared with plants selected in monocultures. a, b, Plants selected in 
mixture plots in the Jena Experiment over 8 years showed stronger biodiversity 
effects than plants selected in monoculture plots over the same time period. 
Mix, mixture types; Mono, monoculture types. a, Biodiversity effects were 
assessed by additive partitioning’ of net effects into complementarity and 
sampling effects (n = 545). b, The plots are ordered by functional group 
combinations: grasses (g); small herbs (sh); tall herbs (th); legumes (1). Symbols 
are means +1 standard error of the mean (s.e.m.) calculated from 

raw data. 


and also tested whether such niche differentiation was greater for multi- 
functional group mixture types than for monofunctional group mixture 
types. Furthermore, we tested for character displacement by measuring 
relative differences in functional traits that reflect plant growth strategies’®, 
height and specific leaf area (SLA) between species in mixtures (abso- 
lute difference between two species divided by the mean of the two). 
Mixtures of mixture types had higher biomass than mixtures of mono- 
culture types (Extended Data Table 1, P = 0.024), and this pattern was 
consistent across functional group combinations. Net biodiversity effects 
and complementarity effects were larger for mixture types than for mono- 
culture types (Fig. 2a, b, P = 0.017 and P = 0.005, respectively; Table 1), 
indicating increased niche differentiation, with consistent results across 
the majority of functional group combinations, as illustrated in Fig. 2b. 
The positive effect of mixture types on net effects was strongest in grass 
mixtures, short-herb mixtures and mixtures of grasses with short herbs. 
Strong complementarity effects in response to selection history were 
found in these same mixtures, as well as in mixtures of legumes with 
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grasses or tall herbs. Positive sampling effects in response to selection 
history were found for grass mixtures, legume mixtures and mixtures of 
legumes with short herbs, but in general the response of sampling effects 
to selection history was slightly negative (Fig. 2a). 

The stronger biodiversity effects obtained with mixture types in contrast 
to monoculture types were mirrored by larger functional trait differ- 
ences between species in mixtures of mixture types rather than mono- 
culture types. This was the case for relative height differences between 
species (Fig. 3a; Extended Data Table 2, P = 0.011) and for relative dif- 
ferences in SLA (Fig. 3b, P< 0.001). Mixture test communities with le- 
gumes showed particularly large SLA differences between mixture types 
(Extended Data Table 2, P< 0.001). Functional diversity, calculated from 
height, SLA and additionally reproductive biomass, was greater in test 
communities of mixture types than of monoculture types (Fig. 3c and 
Extended Data Table 2). We found a trend of increased intraspecific 
functional diversity when monocultures were planted with monocul- 
ture types as compared with mixture types (mean difference = —0.202, 
paired t-test, P= 0.101), indicating a broader spread of phenotypes 
within the monoculture type populations. 

The association between increased biodiversity effects and increased 
functional trait differences of mixture types was reflected in marginally 
significant correlations across selection history treatments: complemen- 
tarity effects rose with greater relative differences in SLA (P = 0.073) and 
selection effects rose with greater relative differences in height (P = 0.074). 
We expect the functional traits measured here to be representative of 
relevant niche dimensions. However, niche differentiation is probably 
multivariate, and additional traits such as rooting depth could be in- 
cluded in future experiments. Furthermore, such experiments should 
assess potential changes of traits during the course of an experiment; 
here we only measured them at the end. 

We demonstrate increased mixture performance and biodiversity 
effects in test communities of mixture types and provide evidence that 
these increases were driven by increased niche differentiation due to 
character displacement of functional traits between species. Our results 
were obtained with 12 typical grassland species of four different func- 
tional groups, supporting our hypothesis that increased biodiversity 
effects can result from selection for increased mixture performance and 
suggesting that these results may apply more generally. In a field-based 
extension of the current work, we found increased biodiversity effects 
across a wider range of species (n = 52) and species richness levels (1, 2, 
4, 8) for communities sharing a common selection history (Extended Data 
Fig. 2, interaction of selection history and species richness, P< 0.001). 
Whereas mixtures may select for increased complementarity and charac- 
ter displacement between species, it is conceivable that selection pres- 
sures in monocultures select for greater defence against species-specific 
pathogens known to accumulate in monocultures®. We observed increas- 
ed monoculture performance and reduced biodiversity effects in mix- 
tures planted with monoculture types. Because mixture and monoculture 
types experienced selection environments for only 8 years, the standing 
variation at the beginning of the experiment may have already included 
genotypes or epigenetic variants pre-adapted for monoculture or mixture 
environments’”"*. 


Table 1 | Net effect, complementarity effect and sampling effect on community biomass 


NE CE SE 
Source of variation numDF denDF F P denDF F P denDF F P 
Selection history 
Monoculture versus mixture 1 62.9 6.04 0.017 644 8.49 0.005 66.7 0.01 0.921 
Functional group combination 9 61.4 24 0.021 61.1 1.05 0410 64.7 1.91 0.067 
Monoculture versus mixture x functional group combination 9 66.2 143 0.193 69.0 1.32 0.242 70.5 0.90 0.527 
Random terms n ie; s.e. ie Se. vc se. 
Monoculture versus mixture X species combination 86 0.942 0.415 0.123 0.086 0.117 0.056 
Residual 545 8.314 0.547 2.254 0.148 1.245 0.082 


Results of mixed-effects analysis of variance (ANOVA) for net effects (NE; untransformed), complementarity effects (CE; square-root transformed) and sampling effects (SE; square-root transformed). denDF, 
degrees of freedom of error term (which can be fractional in residual maximum likelihood analysis); numDF, degrees of freedom of term. F, variance ratio; n, number of replicates for random effects; P, error 


probability; s.e., standard error of variance component; VC, variance component. 
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Figure 3 | Plants selected in mixtures show character displacement between 
species when grown in mixture. a-c, After 8 years of selection in the Jena 
Experiment, mixture types (Mix) in comparison with monoculture types 
(Mono) showed character displacement between species. a, b, Relative 
differences between the two species in mixture for plant height (n = 219 
aggregated differences) (a) and SLA (n = 208 aggregated differences) (b). 

c, Functional diversity*® (calculated from height, SLA and reproductive 
biomass) of the two species in mixture (unit-less functional diversity index, 

n = 219 aggregated values). Symbols are means +1 s.e.m. calculated from raw 
data. Means are averages over all 50 species combinations. 


Niche differences between species decrease the strength of interspe- 
cific competition relative to intraspecific competition’. Thus, selection 
in high-diversity communities with high interspecific competition can be 
expected to favour genotypes with more distinct niches, reducing niche 
overlap and competition between species. Reduced interspecific compe- 
tition could also result from the extension of the total community niche” 
in addition to, or instead of, finer division of currently used resources. 
The notion of diversity as a driver of plant population differentiation has 
been suggested in theory”' and a field study has demonstrated that dif- 
ferential selection for monoculture and mixture types in grassland spe- 
cies can occur’, supporting the idea that local evolutionary changes arise 
from selection through competition” over short time scales™. In addition, 
species diversity has been shown to influence plant traits associated with 
light and resource uptake such as shoot, leaf and stem length”*”®, with 
increasing functional diversity within communities”. In our study, mixture 
types across all functional groups showed greater relative differences in 
height and SLA between species in mixtures. These between-species dif- 
ferences may have resulted from directional selection or selection for 
increased plasticity in mixture types. Finally, infrared spectral finger- 
prints obtained for 8 of the 12 species showed significant differences in 
metabolic profiles between monoculture- and mixture-type indivi- 
duals (Extended Data Fig. 3), reflecting differential chemistry. 

We demonstrated an interaction between selection community and 
test community diversity, with increased community performance of 
mixture types. Furthermore, mixture types exhibited interspecific trait 
divergence, which potentially explains the increased biodiversity effects 
for mixtures of mixture types in our study. The consequences of selec- 
tion in long-term field experiments have previously been considered in 
theory”, and selection has been shown to drive community dynamics 
in microcosms”; our results demonstrate that such evolutionary pro- 
cesses can cause the emergence of stronger biodiversity effects over time 
in plant biodiversity experiments. Capturing the potential of this pro- 
duction-enhancing niche differentiation in diverse communities may 
have profound impacts for agricultural and forestry applications, for 
example by achieving increased productivity in crop mixtures using 
varieties that have been selected in diverse planting regimes. This novel 
mechanism also implies that species losses can affect not only ecosystem 
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functioning in the short term, but also the long-term trajectory of biolo- 
gical communities. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Experimental setup. To test whether plant types selected over 8 years in mixtures 
outperform those types selected in monocultures when assembled in mixture test 
communities, and vice versa for types selected in monocultures, we first selected 12 
of 16 species grown in large monoculture and mixture plots in the Jena Experiment, 
Germany (50°55’ N, 11°35’ E, 130 m above sea level) (http://www.the-jena-experiment. 
de; see ref. 14 for experimental details). Altogether there are 60 species in the Jena 
Experiment, but 44 of them do not occur in large monoculture plots. Three species 
from each of the following four functional groups of plant species used in the Jena 
Experiment were selected (a fourth species in each functional group was omitted 
because of poor growth in monoculture): grasses (Festuca pratensis, Festuca rubra, 
Poa pratensis), small herbs (Plantago lanceolata, Prunella vulgaris, Veronica chamae- 
drys), tall herbs (Crepis biennis, Galium mollugo, Geranium pratense) and legumes 
(Lathyrus pratensis, Onobrychis viciifolia, Trifolium repens) (Extended Data Fig. 1). 
These species had an 8-year community history growing in monocultures or mix- 
tures consisting of plants belonging to a single functional group or to all four func- 
tional groups. In April 2010, we collected 4,900 plant cuttings from 48 of the 82 plots 
in the Jena Experiment. 

We used these plant cuttings to establish plots in an experimental garden in slug 
exclosure compartments at the University of Zurich, Switzerland (47°23' N, 8°33’ E, 
534 m above sea level) with an identical plant composition to the plots in Jena from 
which the cuttings were collected. We added a layer of potting soil (BF 4, De Baat; 
Extended Data Table 3) to the soil in each plot to make sure the plants established. 
Netting around each plot minimized the possibility of cross-pollination between the 
same species from different community histories. The cuttings were used for the prop- 
agation of further cuttings or to produce seeds for our study. 

Using 25 of the plant cuttings of each species from the three field community 
histories, we generated further cuttings, from April 2010 until September 2010, in 
pots in the experimental glasshouse to set up the first block of the experiment. 
Finally, the cuttings used in the experiment were the result of six rounds of pro- 
pagation. This was done to reduce potential carry-over effects. Each set of cuttings 
was timed according to the slowest growing species, every 3 to 4 weeks. 

During the summer of 2010, we collected seed material from the experimental 
garden plots for the second block of this experiment. The seed material was dried in 
a glasshouse compartment. We cleaned the seeds from the husks/pods and stored 
them at 10-15 °C, 50% humidity in a climate chamber. Once all the seeds were cleaned, 
they were treated with cold stratification at 5 °C for 2 months. Seeds were germi- 
nated in a 10.5-h day regime with 14-19 °C day and 10-16 °C night temperature. 

In November 2010 we transplanted randomly selected individuals that were cut- 
tings from the 25 original cuttings. We planted monocultures of four plants or two- 
species mixtures of two plus two plants into pots (4,275 cm’) filled with neutral 
agricultural soil (Extended Data Table 3) according to a diallel design containing all 
possible combinations of species within and among functional groups according to 
available plant material (Extended Data Fig. 1). Cuttings of the legume Onobrychis 
viciifolia were not successfully propagated and were therefore excluded from the 
first block. In October 2011, we transplanted seedlings into pots following the same 
procedure and design. In total we planted 12 monoculture and 50 two-species com- 
binations as test communities with plants of three types of selection history: mono- 
culture types and mixture types taken from mixtures of a single functional group 
(monofunctional group mixture types) and from mixtures of four different plant 
functional groups (multifunctional group mixture types), replicated, if possible, three 
times for cuttings and three times for seedlings (n = 855 pots; Extended Data Fig. 1). 
Single pots always contained four plants of a single selection history. Plant traits and 
biomass were measured 20 weeks after planting for the block established with cut- 
tings and the block established with seedlings. To exclude effects of plant-soil feed- 
backs, we used a neutral growth substrate (50% agricultural sugarbeet soil, 25% sand, 
25% Perlite; Ricoter AG; Extended Data Table 3) throughout the experiment. 

Once the plants were transplanted into the pots, glasshouse conditions were set 
to natural summer day length and day temperatures of 20 °C and night temperatures 


of 17 °C. To supplement sunlight, additional light was provided at a maximum of 
30 kLux (Metallhalogenlamps 400 W, Iwasaki MT 400 DL/BH). Shading was at 
20 kLux. To compensate for overheating, an adiabatic cooling system (Airwatech) 
was used. The plants were watered in the trays to make sure that each individual 
received equal water volume. Seedlings that died in the first 2 weeks were replaced 
with seedlings of the same age. Pot locations were randomized in the glasshouse 
without reference to history or species combination. 

Measurements and harvest. Height and leaf number were measured at planting to 
ensure that the cuttings and seedlings were standardized. After 20 weeks of growth 
in the pots, plant height was measured again and the aboveground biomass of each 
individual was harvested at ground level. The inflorescence, if present at harvest, was 
collected separately and the dry biomass weighed as an indication of reproductive 
effort. SLA of representative leaves of each species in a pot was measured by scanning 
fresh leaves (Licor LI-3100) immediately after harvest and determining the mass of 
the same leaves after drying. Research assistants assisted in the regular measure- 
ments and harvesting of plants at the end of the experiment, and these assistants were 
not informed of the specific experimental treatments. 

Statistical analysis. We compared the performance of plants in monoculture versus 
mixture test communities after being selected in either monoculture or mixture 
communities for 8 years using the mean aboveground dry biomass of our test com- 
munities as the response variable. We used general mixed models using residual 
maximum likelihood (REML) and summarized results in ANOVA tables. Significance 
tests were based on approximate F-tests using appropriate error terms and denom- 
inator degrees of freedom. The fixed terms in the models were: block (cuttings versus 
seedlings); planted diversity (monoculture versus mixture); planted functional group 
diversity (monoculture versus monofunctional group mixture versus multifunctional 
group mixture); selection history (monoculture selection versus mixture selection, 
resulting in monoculture versus mixture types); functional group diversity of the 
selection history (monoculture selection versus monofunctional group selection versus 
multifunctional group selection); presence or absence of legumes in the test commu- 
nity; functional group combination within a test community; and interactions among 
these. Glasshouse table, pot within table, and species combination were used as 
random terms. No outliers were excluded from this analysis; the full model is pre- 
sented in Extended Data Table 1. 

To assess biodiversity effects we followed the additive partitioning method pre- 
viously described"’, and partitioned the net effect (NE) into complementarity effects 
(CE) and sampling effects (SE). Calculations were based on the difference between 
the observed yield of each species in the mixture and mean monoculture yield for 
that species in the corresponding block and for that specific selection history. 
Absolute values of CE and SE were square-root transformed and the original signs 
put back on the transformed values for analysis'*. There were 545 pots with 
mixture test communities for which NE, CE and SE could be calculated. For the 
analysis of the biodiversity effects we could simplify the full mixed model described 
earlier to a concise model presented in Table 1. 

Relative differences in height and SLA (absolute difference between two species 
divided by the mean of the two), as well as functional diversity (FD), were calculated 
between species in mixtures. Differences in these measures between mixtures of mono- 
culture versus mixture types might be associated with the differences in biodiversity 
effects between mixtures of monoculture versus mixture type; this would identify 
character displacement as a potential mechanism underlying positive biodiversity 
effects. FD was calculated following ref. 30 using height, SLA and reproductive biomass 
as functional traits. Relative differences and FD values between species means in 
mixture test communities were aggregated at the level of species combination by 
selection history by block (n = 219), using means calculated for each species by 
selection history by block combination. Statistical analysis was done with the full 
model described earlier and is presented in Extended Data Table 2. 

Statistical analyses were conducted using the software products R, version 2.15.3 
(R Development Core Team), and GenStat, version 16 (VSN International). 
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Extended Data Figure 1 | Designed number of pots planted for each 
combination of species. Plants from three different selection histories 
(monoculture, plot containing one species; monofunctional group, plot 
containing at least four species of the same functional group of plants; mixed- 
functional group, plot containing at least four species of four different 
functional groups) were grown in three different types of test communities 
(monoculture, monofunctional mixture, mixed-functional mixture). Twelve 
species in the Jena Experiment were chosen from all four functional groups: 
grass (g) (Festuca pratensis, Festuca rubra, Poa pratensis), small herb (sh) 
(Plantago lanceolata, Prunella vulgaris, Veronica chamaedrys), tall herb (th) 
(Crepis biennis, Galium mollugo, Geranium pratense), legume (1) (Lathyrus 
pratensis, Onobrychis viciifolia, Trifolium repens); numbers after the letter 
abbreviations refer to the different species. This design was used once with 
plants raised from cuttings (Block 1) and once with plants raised from seedlings 


(Block 2). Overall we aimed to obtain the same 12 monocultures and 48 two- 
species combinations as test communities for each block. Availability of species 
precluded some of the two-species combinations in each block, such that they 
had to be replaced by other combinations. This yielded a total of 50 
combinations across the two blocks, with several that were unique within a 
block. Each monoculture and each two-species combination was assembled 
three times for each of the three types of selection histories in each block. Some 
monocultures and some two-species combinations could not be realized with 
all types of selection histories in both blocks. Overall, there were 855 pots, 168 
monocultures and 687 two-species mixtures; for 545 of the latter, the net 
biodiversity effect could be partitioned into complementarity and sampling 
effects. Some missing monocultures precluded the calculation of biodiversity 
effects in certain mixtures. 
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Extended Data Figure 2 | Biodiversity-productivity relationship is stronger 
for plants with a common selection history. Aboveground net primary 
productivity of communities in an experimental manipulation of plant species 
richness and selection history (common history versus no common history). 
In this experiment, species represented an expanded set from the present 
experiment (52 species), and were planted within a large-scale field experiment 
in Jena, Germany, on mixed soil from 48 plots from which plants had been 
selected, thus equalizing potential effects of soil legacy among treatments. 
Plants without selection history were grown from seed from a seed company, 


while plants with selection history were seed progeny from plots of exactly the 
same species composition as the one in which they were replanted (same 
propagation procedure as for the 12 species used in the test communities of the 
present study). The slope of the biodiversity-productivity relationship was 
steeper for plants with a common selection history (significance of slope 
differences tested with interaction term log(species richness) X selection 
history in mixed model with random-effects factor for 48 specific plant 
communities; P< 0.001, n = 96). 
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Extended Data Figure 3 | Selection for different biochemical features in 
monocultures and mixtures. Ordinations (non-metric multidimensional 
scaling (NMDS)) of second derivative of spectral wavenumbers of 8 of the 12 
species used in the present study, showing effects of 8-year selection history on 
plant individuals derived from monoculture and mixture communities (Jena 


Experiment). This can be an indication of selection for different biochemical 
features over 8 years in monoculture and mixtures. Stress values reflect a 
measure of goodness of fit for NMDS, with lower values showing better 
representation of the original data. 
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rps ana Data Table 1 | Results of mixed-effects ANOVA for the aboveground biomass of test communities 20 weeks after transplanting 
plants into pots 


Source of variation numDf denDf F P 
Seedlings versus cuttings 1 65.2 5.89 0.018 
Selection history: monoculture versus mixture 1 717.4 0.55 0.457 
Selection history: monofunctional group versus multi-functional group mixture 1 707.1 0.11 0.735 
Planted diversity: monoculture versus mixture 1 41.2 4.84 0.034 
Planted diversity: monofunctional group versus multi-functional group mixture 1 41.9 0.09 0.767 
Selection history: monoculture versus mixture x planted diversity: monoculture versus mixture 1 714.3 5.15 0.024 
Selection history: monofunctional group versus multi-functional group mixture x planted diversity: 

monofunctional group versus multi-functional group mixture 3 713.4 1.30 0.274 
Legumes 1 44.5 38.02 <0.001 
Functional group combination 7 43.8 2.51 0.029 
Selection history: monoculture versus mixture x legumes 1 727.7 4.04 0.045 
Planted diversity: monoculture versus mixture x legumes 1 46.3 0.14 0.712 
Selection history: monoculture versus mixture x functional group combination 7 729.2 1.17 0.318 
Planted diversity: monoculture versus mixture x functional group combination 2 42.4 0.41 0.664 
Selection history: monoculture versus mixture x planted diversity: monoculture versus mixture 

x legumes 1 704.0 1.27 0.260 
Selection history: monoculture versus mixture x planted diversity: monoculture versus mixture 

x functional group combination 2 731.1 1.58 0.207 
Selection history : monofunctional group versus multi-functional group mixture x planted diversity: 

monofunctional group versus multi-functional group mixture x legumes 3 725.8 0.45 0.719 
Selection history : monofunctional group versus multi-functional group mixture x planted diversity: 

monofunctional group versus multi-functional group mixture x functional group combination 8 719.6 2.23 0.023 
Block x selection history: monoculture versus mixture 1 729.9 2.9 0.089 
Block x planted diversity: monoculture versus mixture 1 50.5 0.00 0.986 
Block x selection history: monoculture versus mixture 

x planted diversity: monoculture versus mixture 1 722.1 2.08 0.149 
Block x selection history: monofunctional group versus multi-functional group mixture 

x planted diversity: monofunctional group versus multi-functional group mixture 5 361.9 0.68 0.642 
Random terms n vc s.e. 

Block x glasshouse table 44 0.662 0.235 

Species combination 62 0.772 0.764 

Block x species combination 109 2.905 0.822 

Residual 855 6.708 0.367 


denDF, degrees of freedom of error term (which can be fractional in residual maximum likelihood analysis); numDF, degrees of freedom of term. F, variance ratio; n, number of replicates for random effects; P, error 
probability; s.e., standard error of variance component; VC: variance component. 
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Extended Data Table 2 | Results of mixed-effects ANOVA for relative height difference, relative SLA difference and for functional diversity 


Height difference SLA difference Functional diversity 


Source of variation numDf denDf F P numDf denDf F P  numDf denDf F P 


Seedlings versus cuttings 1 36.1 0.10 0.757 1 28.6 26.21 <0.001 1 33.7 3.07 0.089 


Selection history: monoculture versus mixture 1 30.7 7.25 0.011 1 27.2 20.62 <0.001 1 32.3 17.42 <0.001 
Planted diversity: monofunctional group versus 

multi-functional group mixture - - - - 1 38.0 1.24 0.272 1 37.8 4.23 0.047 
Functional group combinations with legumes 1 37.1 8.20 0.007 1 38.7 0.28 0.602 1 39.2 3.69 0.062 
Functional group combinations rest 8 36.8 0.90 0.525 7 38.5 3.25 0.008 7 38.2 4.64 <0.001 
Seedlings versus cuttings x selection history: 

monoculture versus mixture 1 83.4 37.24 <0.001 1 82.3 91.76 <0.001 1 85.3 36.41 <0.001 


Seedlings versus cuttings x planted diversity: 
monofunctional group versus multi-functional group 
mixture - - - - 1 27.7 434 0.047 1 33.3 2.84 0.101 


Seedlings versus cuttings x functional group 
combinations with legumes 1 36.4 0.43 0.517 - - - - 1 34.6 5.97 0.02 


Seedlings versus cuttings x functional group 
combinations rest 8 36.1 0.55 0.809 - - - - - - - - 


Seedlings versus cuttings x functional group 
combinations all - - - - 8 29.2 1.20 0.334 - - - - 


Selection history: monoculture versus mixture x 
planted diversity: monofunctional group versus 


multi-functional group mixture - - - - - - - - 1 32.3 1.37 0.251 
Selection history: monoculture versus mixture x 
functional group combinations with legumes 1 31.1 2.11 0.156 1 27.6 18.13 <0.001 1 32.7 13.00 0.001 
Selection history: monoculture versus mixture x 
functional group combinations rest 8 30.7 2.48 0.034 8 27.2 0.60 0.773 - - - - 


Seedlings versus cuttings x selection history: 

monoculture versus mixture x planted diversity: 

monofunctional group versus multi-functional group 

mixture - - - - - - - - 1 88 12.08 <0.001 


Seedlings versus cuttings x selection history: 
monoculture versus mixture x functional group 
combinations all - - - - 9 83.7 8.96 <0.001_ - - - - 


Seedlings versus cuttings x selection history: 
monoculture versus mixture x functional group 
combinations with legumes 1 84.9 20.11 <0.001_ - - - - 1 85.8 5.41 0.022 


Seedlings versus cuttings x selection history: 
monoculture versus mixture x functional group 


combinations rest 8 84.6 3.22 0.003 - - - - 21 92.3 2.51 0.001 
Random terms n VC s.e. n VC s.e. n VC s.e. 
Species combination 50 0.006 0.031 50 0.040 0.015 50 4139 4449 
Selection history: monoculture versus mixture x 

species combination 93 0.005 0.002 89 0.007 0.003 93 6784 2010 
Seedlings versus cuttings x species combination 88 0.160 0.041 85 0.025 0.008 88 14483 4023 
Residual 219 0.002 0.003 208 0.005 0.001 219 2152 350 


denDF, degrees of freedom of error term (which can be fractional in residual maximum likelihood analysis); numDF, degrees of freedom of term. F, variance ratio; n, number of replicates for random effects; P, error 
probability; s.e., standard error of variance component; VC: variance component. 
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Extended Data Table 3 | Composition of the experimental substrate 


Parameter Unit GVZ AGR 
Carbon* ug/g 411.5 34.60 
Hydrogen* ug/g 47.7 4.6 
Nitrogen* ug/g 8.0 2.1 
pHt 5.4 7.9 
Organic matter’ %(mass) 55 3.5 
Clay" %(mass) 1.0 1.0 
silt" %(mass) 1.0 1.0 
Nitrate" mg/l 439 730 
Ammonium" mg/l 0.7 2.5 
Phosphorus* mg/l 20 0.3 
Potassium’ mg/l 54 127 
Calcium? mg/l 119 187 
Magnesium mg/| 43 40 


Composition of 1 g of substrate GVZ Tref GO PP 7000 (BF4: black peat; white peat; clay; mineral fertilizer, 1.3 kg m~°) and neutral agricultural soil (50% sugarbeet soil, sieved; 25% washed river sand, 0-2 mm; 
25% perlite, 2-6 mm; AGR; RicoterAG). All units in mg! lare per litre extract solution. 

* Composition determined using elemental analysis. 

+ Composition determined by Ibu (Laboratory for Soil Analysis, Thun, Switzerland), program 40 analysis. 
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Nodal signalling determines biradial asymmetry 


in Hydra 
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In bilaterians, three orthogonal body axes define the animal form, 
with distinct anterior—posterior, dorsal-ventral and left-right asym- 
metries. The key signalling factors are Wnt family proteins for the 
anterior—posterior axis, Bmp family proteins for the dorsal-ventral 
axis and Nodal for the left-right axis’. Cnidarians, the sister group 
to bilaterians, are characterized by one oral-aboral body axis, which 
exhibits a distinct biradiality of unknown molecular nature. Here we 
analysed the biradial growth pattern in the radially symmetrical cni- 
darian polyp Hydra, and we report evidence of Nodal in a pre-bilaterian 
clade. We identified a Nodal-related gene (Ndr) in Hydra magnipa- 
pillata, and this gene is essential for setting up an axial asymmetry 
along the main body axis. This asymmetry defines a lateral signal- 
ling centre, inducing a new body axis of a budding polyp orthogonal 
to the mother polyp’s axis. Ndris expressed exclusively in the lateral 
bud anlage and induces Pitx, which encodes an evolutionarily con- 
served transcription factor that functions downstream of Nodal. 
Reminiscent of its function in vertebrates”’, Nodal acts downstream 
of B-Catenin signalling. Our data support an evolutionary scenario 
in which a ‘core-signalling cassette’ consisting of B-Catenin, Nodal 
and Pitx pre-dated the cnidarian-bilaterian split. We presume that 
this cassette was co-opted for various modes of axial patterning: for 
example, for lateral branching in cnidarians and left-right pattern- 
ing in bilaterians. 

An animal body plan can be described by using the axes of a Car- 
tesian coordinate system™*. Bilaterian animals, for example, exhibit one 
major body axis, passing from anterior to posterior, a horizontal plane 
of symmetry separating the dorsal and ventral sides and a mid-sagittal 
plane of symmetry separating the left and right sides. How these body 
axes and bilaterality arose in ancestral animals, which have a simpler 
body plan, is unknown’. An answer to this question might be found in 
the sister group to the bilaterian clade, cnidarians: this diverse clade is 
more than 600 million years old and contains jellyfish (Scyphozoa, 
Staurozoa and in Hydrozoa), corals (in Octocorallia and in Hexacorallia), 
the freshwater polyp Hydra (in Hydrozoa) and less familiar forms such 
as sea anemones (in Hexacorallia) and hydroids (in Hydrozoa)° (Fig. 1a). 
The essence of the simple, gastrula-shaped cnidarian body plan is radi- 
ality with a major oral-aboral body axis*®. Perfect radial symmetry is, 
however, uncommon, since most cnidarians exhibit morphological fea- 
tures of distinct biradiality either in their internal organization (antho- 
zoans) and/or in colonial branching patterns (anthozoans and hydrozoans) 
(Fig. 1b). Even in the freshwater polyp Hydra, whose morphology shows 
perfect radial symmetry, evidence for biradiality has been reported dur- 
ing the formation of buds, which arise in a biradial pattern’ (Fig. 1c), 
similar to branching colonial hydrozoans (Fig. 1b). 

To identify the molecular factors causing biradial asymmetry in cni- 
darians, we systematically searched in cnidarian genome databases for 
members of the Tgf-B superfamily known to be involved in defining the 
dorsal-ventral or left-right axis in bilaterians. Based on molecular phy- 
logenetic analyses and experimental results, sequences were grouped 
into four categories: Activin and Tgf-B, Bmp2/4, Bmp5-8 and Nodal 


families (where cnidarian Bmp2/4, for example, is the common ances- 
tor of bilaterian Bmp2 and Bmp4) (Extended Data Figs 1-3 and Sup- 
plementary Table 1). Of particular interest was a Nodal-related gene 
(hereafter denoted Ndr), which tends to cluster with bilaterian Nodal 
(Extended Data Fig. 2). Our phylogenomic analysis revealed that an- 
tagonists of Nodal signalling (that is, members of the Cerberus and Dan 
family), the signal transduction molecule Smad2/3, and the downstream 
target transcription factors Pitx, FoxA and Lhx1/5 are encoded in cni- 
darian genomes (Extended Data Figs 1 and 4-6 and Supplementary 
Table 1). This finding suggests that in the last ancestor of Cnidaria and 
Bilateria, some of these components of the Nodal-signalling pathway 
had already been specifically deployed. Among the Tgf-f family mem- 
bers encoded by Hydra, only H. magnipapillata Ndr exhibited a clear 
asymmetrical expression pattern in the presumptive budding zone and 
in early buds, and when the buds developed further, its expression dis- 
appeared (Fig. 1d). By comparison, the Hydra genes orthologous to 
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Figure 1 | Branching morphology in cnidarians and expression of Nodal- 
signalling genes in Hydra. a, A simplified metazoan tree with the major 
cnidarian clades exhibiting a branching (orange text) or non-branching (black 
text) morphology. The images show Obelia dichotoma (Hydrozoa) (top), a 
Telesto sp. (Octocorallia) (centre) and Melithaea flabellifera (Octocorallia) 
(bottom). (Images courtesy of H. Namikawa (top and bottom) and Y. Imahara 
(centre).) b, A colony of an athecate hydrozoan shows a branching pattern 
with consecutive polyp generations (A, B and C) that remain attached (from 
ref. 25). c, The biradial pattern in Hydra bud formation’ is related to the colonial 
branching pattern (b). A mother polyp (A; blue) forms continuous buds 
(B1-B3; yellow) that arise perpendicularly to the oral-aboral axis in the 
budding zone and detach. The buds are induced directly opposite each other, 
resulting in an alternating pattern of lateral branching. C1 (pink) indicates 
the position of the third bud generation. (Unpublished image courtesy of 

B. Hobmayer.) d, e, The biradial expression patterns of H. magnipapillata Ndr 
(d; blue) and Pitx (e; blue) in the bud anlage of budless and budding Hydra 
polyps with emerging buds (B1-B3). Scale bar, 2mm. 
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Figure 2 | Requirement for Nodal signalling for lateral bud initiation. 

a, Tissue of the budding zone expressing green fluorescent protein (GFP) was 
treated with purified recombinant H. magnipapillata Ndr protein and 
transplanted into the identical region of untreated wild-type Hydra; BSA was 
used as a control. b, Quantification of the effect of Ndr treatment on bud 
formation. c, Enhancement of bud-forming capacity by Ndr was confirmed by 
overexpressing Ndr after electroporation of cells with Ndr-expressing plasmid. 
d, Experimental scheme showing Alk4/5/7 inhibitor treatment and siRNA- 
mediated Ndr or Pitx knockdown. Note that Ndr and Pitx, but not ActL1, are 
expressed (blue) at stage 1. The lines in orange and blue show Ndr inhibition 
starting at bud stages 1 and 3, respectively. e, Treatment with the inhibitors 
A-83-01 (50 nM), SB-505124 (10 1M), LY-364947 (100M) or SD-208 

(2.5 UM) inhibits bud induction when treatment started at bud stage 1. DMSO, 
dimethylsulphoxide. f, A-83-01 inhibited bud induction (B1 and B2 buds). 

g, A-83-01 had no effect on budding when treatment started after bud 

stage 3. h, i, Knockdown of Ndr (h) or Pitx (i) using siRNA at bud stage 1 
suppressed bud induction. The kinetics of bud inhibition in siRNA-transfected 
animals are shown. b, c, e, f-i, The data are presented as mean + s.e.m. 

*, P<0.05 compared with control (BSA, DMSO or GFP siRNA); 

one-sided f-test. 


Bmp2/4, Bmp5-8 (ref. 8) and Activin-related genes were expressed ubiq- 
uitously or mainly at the oral side of polyps and did not show any sign 
of lateral asymmetry within the budding zone (Extended Data Fig. 7). 
In bilaterians, Nodal and its target transcription factor Pitx2 constitute 
a co-expressed signalling unit”!°. The H. magnipapillata Pitx gene was 
expressed unilaterally in the earliest bud stage, similarly to Ndr (Fig. le 
and Extended Data Fig. 7). The other Nodal target genes in H. magni- 
papillata, FoxA and Lhx1/5 (ref. 11), were also expressed in the devel- 
oping bud but not in the presumptive budding zone (Extended Data 
Fig. 7). 
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Figure 3 | Autoregulatory feedback loop of Nodal signalling and 
asymmetrical patterning of the budding zone. a, Reversible suppression of H. 
magnipapillata Ndr, Pitx and Bral expression (blue) by A-83-01 treatment. 
Ndr, Pitx and Bral expression is suppressed by A-83-01 but strongly 
upregulated 24h after inhibitor removal. The numbers on the images denote 
the number of Ndr-, Pitx- or Bral-expressing polyps/the total number of 
polyps. b, Expanded bud formation after inhibitor removal. Note the 
perturbation of the biradial asymmetry of the budding zone after A-83-01 
removal, with an expansion of Pitx expression (blue) and multiple signalling 
centres expressing the head organizer genes Wnt3a and Bral (both blue) in a 
spot-like pattern. The numbers denote the number of biradial, expanded or 
radial polyps/the total number of polyps. 


To investigate the function of H. magnipapillata Ndr signalling, we 
examined the effect of recombinant Ndr on bud formation. When tissue 
explants of the budding zone from budless animals were treated with 
purified recombinant Ndr protein and transplanted into the corres- 
ponding region of untreated hosts, the rate of budding was markedly 
higher than for BSA-treated control tissue (Fig. 2a, b). We also observed 
an activation of bud formation in animals in which Ndr was overex- 
pressed after electroporation of an Ndr-expressing plasmid (Fig. 2c). 
While Ndr-transfected cells were observed throughout the body axis, 
the overexpression increased the budding rate only from the budding 
zone and did not induce ectopic bud formation (Extended Data Fig. 8). 
This finding suggests that only the budding zone contains cells com- 
petent for Ndr activity and/or that additional signalling pathways are 
required for bud induction. 

The requirement for Ndr for bud formation was also examined by 
using chemical inhibitors of Activin-receptor-like kinase 4 (Alk4)/5/7. 
Nodal, Activin and Tgf-B share Alk4/5/7 as receptors, and their activ- 
ities can be blocked through specific interference by administering A- 
83-01, SB-505124, LY-364974 and SD-208 (ref. 12). While none of the 
Hydra Activin-related genes was expressed before the bud became vis- 
ible at stage 3 (Fig. 2d and Extended Data Fig. 7b), H. magnipapillata 
Nar and Pitx were exclusively expressed in the bud anlage at stage 1-2 
(Fig. 2d). When we treated polyps with Alk4/5/7 inhibitors from bud 
stage 1-2 onwards (Fig. 2d, orange line), the bud formation was sig- 
nificantly inhibited, with the strongest effect with A-83-01 administra- 
tion (Fig. 2e, fand Extended Data Fig. 9a—d). By comparison, inhibitor 
treatment from stage 3 onwards yielded normal bud formation (Fig. 2g). 
The crucial function of Ndr and Pitx in budding was confirmed by short 
interfering RNA (siRNA)-mediated gene knockdown experiments 
(Fig. 2h, i, and Extended Data Fig. 10). These data show that the func- 
tion of Ndr and Pitx is essential for bud initiation. 

In animals that were treated with the inhibitor A-83-01, we found 
complete downregulation of the expression of H. magnipapillata Ndr, 
Pitx and Brachyuryl (Bra1), an early head marker, at the bud anlage 
(Fig. 3a). This indicates that Nodal signalling is required for the asym- 
metrical expression of Pitx and Bral and suggests an autoregulatory 
control of Ndr expression. Removal of the inhibitor resulted in an un- 
expectedly strong response. More than half of the treated polyps did 
not regain their normal, biradial, budding pattern. Instead, they showed 
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expanded expression domains of Ndr, Pitx and Bral around the bud- 
ding zone (Fig. 3a). The resultant bud tissue exhibited expanded and/or 
multiple expression domains of Wnt3a, Bral and Pitx (Fig. 3b), as well 
as an increase in the number of head-specific tentacles (Supplementary 
Table 2). These data not only show that Ndr, Pitx and Bral are down- 
stream targets of Nodal signalling in Hydra but also that the localized 
activation of Ndr signalling at one side of the ring-like budding zone is 
crucial for inducing a bud anlage. The expanded expression of Ndr and 
its target genes suggests that, in addition to the autoregulatory feedback 
loop of Nodal signalling, an inhibitory factor is active. The expression 
of this putative inhibitor restricts Nodal activity to the bud anlage. This 
hypothesis is reminiscent of the Nodal-Lefty feedback interaction, which 
belongs to the best-known activator-inhibitor systems in vertebrates*"*. 
Lefty has so far been found only in deuterostomes (Extended Data Figs 1 
and 2). Despite the existence of various Cerberus-related sequences in 
Hydra and other cnidarians (Extended Data Figs 1 and 4a), whole mount 
in situ hybridization (WISH) data exclude the involvement of these 
genes in the asymmetrical induction of the bud anlage (Extended Data 
Fig. 7b). Thus, the molecular nature of Nodal antagonism in Hydra re- 
mains to be identified. 

Next, we addressed the question of regulatory genes that act up- 
stream of Nodal signalling. One candidate is B-Catenin, which is upreg- 
ulated in a ring-like expression domain that defines the budding zone’. 
It has been postulated that within this ring, a single peak of activation 
is created by lateral inhibition, which induces the evagination of the 
bud". Figure 4a shows that H. magnipapillata Ndr exhibits a similar 
expression pattern to /-Catenin in each prospective budding zone but 
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Figure 4 | B-Catenin signalling is essential for H. magnipapillata Ndr 
expression and bud formation. a, Co-expression of H. magnipapillata 
B-Catenin, Ndr and Pitx in the presumptive budding zone and early bud 

(B1 and B2 buds, stage 1). At bud stage 1, the expression of $-Catenin and Ndr 
become restricted towards the bud anlage, resulting in localized Pitx expression. 
b, Increased expression of Ndr, Pitx and Bral in polyps treated with the 
GSK3 inhibitor ALP for 24h. c, Left, Inhibition of Ndr and Pitx expression 
(both blue) by treatment with the B-Catenin inhibitor iCRT14 or the Alk4/5/7 
inhibitor A-83-01. The numbers on the images denote the number of positive 
animals/the total number of animals. Right, Double WISH staining of Ndr 
(blue) and ('-Catenin (red) or Pitx (red) expression after treatment with iCRT14 
or A-83-01. The numbers denote the number of Ndr-positive polyps/the 
number of $-Catenin-positive polyps or the number of Pitx-positive polyps/the 
number of Ndr-positive polyps. d, e, Dose-dependent inhibition of bud 
initiation (d) and bud development (e) by iCRT14 treatment at stages 1 and 3, 
respectively. b, d, e, The data are presented as mean + s.e.m.; *, P< 0.05 
compared with control (0 1M ALP or DMSO); one-sided t-test, (for further 
details, see Methods). 
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then becomes restricted to the side of the emerging bud (stage 1), where 
Pitx becomes transcriptionally activated. This finding suggests that Ndr 
and Pitx act downstream of §-Catenin signalling in initiating the site 
of bud evagination. We tested this hypothesis by ectopically activating 
Wnt--Catenin signalling with the GSK3 inhibitor alsterpaullone (ALP)”*. 
In ALP-treated polyps, Ndr, Pitx and Bral expression were upregulated 
in a dose-dependent manner (Fig. 4b). Inhibition of B-Catenin by treat- 
ing animals with the B-Catenin inhibitor iCRT14 blocked the express- 
ion of these target genes in the budding region (Fig. 4c). Double WISH 
analyses demonstrated that the /-Catenin, Ndr and Pitx have an over- 
lapping expression pattern at the bud anlage (Fig. 4c). Inhibitors of B- 
Catenin (iCRT14) and Nodal (A-83-01) signalling abrogated Ndr and 
Pitx expression in the budding zone (Fig. 4c). Quantitative PCR (qPCR) 
data show that this inhibitory effect is concentration dependent for Pitx 
(Extended Data Fig. 9e). This finding was confirmed by the concentration- 
dependent inhibition of bud formation by treating animals with iCRT14 
at bud stage 1 (Fig. 4d). Inhibition of B-Catenin signalling at bud stages 3 
and later also interfered with bud development (Fig. 4e). This finding 
substantiates previous reports suggesting a function of §-Catenin in the 
Hydra organizer'*"*. It also indicates that Nodal signalling functions 
downstream of Wnt-B-Catenin signalling only during bud initiation to 
induce the head organizer in an asymmetrical and biradial pattern. 
In summary, our data show that $-Catenin and Nodal-Pitx signal- 
ling act together in a core-signalling cassette that has the potential to 
break symmetry in a morphogenetic field characterized by high B- 
Catenin activity (Fig. 5). Downstream of Wnt-f-Catenin signalling, the 
Nodal-Pitx pathway induces biradial budding asymmetry of Hydra. 
The same pathway acts in left-right axis formation in vertebrates and 
snails’*’”-”, in which the symmetry-breaking event is upstream of 
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Figure 5 | Nodal signalling and secondary body axes of eumetazoans. The 
primary Wnt-f-Catenin axis (blue) was established by a primary symmetry 
break in the last common metazoan ancestor and led to the anterior (aboral, 
Ab)-posterior (oral, O) axis pattern. In the common ancestor of cnidarians and 
bilaterians, the core-signalling cassette comprising B-Catenin—-Nodal-Pitx- 
signalling components evolved and induced a secondary symmetry break and 
further body axes (orange). In the branching cnidarian Hydra, the signalling 
cassette was employed to specify an orthogonal axis inducing lateral budding. 
In modern bilaterians, this signalling cassette was deployed to specify the 
left-right (L-R) axis. This signalling cassette also has a crucial role, in a 
concerted action on Bmp signalling, for dorsal-ventral (D-V) axis patterning, 
by inducing Bmp antagonists at the dorsal organizer in vertebrates. A, anterior; 
P, posterior. 
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Nodal expression and a loss of Nodal signalling causes randomization 
of asymmetry*”®. Likewise, we found a randomization of asymmetry 
when Nodal was overexpressed after inhibitor removal and thus over- 
rode the given asymmetry cues (Fig. 3). Our results also provide new 
insight into the evolution of animal body plans. So far, Nodal signal- 
ling has not been considered to have a prominent function in the early 
evolution of metazoan body axes. We propose that the f-Catenin—Nodal- 
Pitx core-signalling cassette evolved in early metazoans before the 
cnidarian-bilaterian split (Fig. 5). Accordingly, the last common ances- 
tor of cnidarians and bilaterians exhibited a major oral—aboral axis pat- 
terned by Wnt-B-Catenin signalling”’ and was able to break radial 
symmetry by using Nodal signalling. Nodal signalling might then have 
been differentially co-opted in animal evolution. In Hydra and other 
cnidarians, Nodal signalling causes biradial branching and colony for- 
mation. In bilaterians, Nodal signalling has a basic role in establishing 
left-right symmetry (see above), is involved in the primary axial pat- 
terning of sea urchins and amphioxus'”'*” and acts in the formation 
of the organizer in chordates”**. Further studies are needed to identify 
how Nodal has contributed to the evolutionarily more complex morpho- 
genetic program for the organizer and for axis formation in bilaterians. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Molecular phylogenetic analysis. Sequences were aligned using MAFFT multiple 
sequence alignment and were checked manually. Phylogenetic trees were recon- 
structed using BioNJ and maximum likelihood as implemented in IQPNNI v3.3 
(ref. 26) (using 600 iterations and the WAG model”) and also using PhyML v3.0 
or v3.1 (ref. 28) (using the LG model’’). Bootstrap support values were obtained from 
100 bootstrap samples (if not stated otherwise) with the same parameters (using the 
bootstrap option -bs in IQPNNI v3.3) and were summarized using a relative major- 
ity consensus as implemented in TREE-PUZZLE v5.3 (ref. 30). In all cases, rate het- 
erogeneity was modelled using four discrete gamma rate categories. The NCBI 
accession numbers for all genes annotated in this study are AB823860 to AB824011. 
C. pacificum sequences are accessible at http://polyp.biochem.uci.edu/blast/. 
Culture of Hydra strains. Experiments were carried out with H. magnipapillata (strain 
105) or Hydra vulgaris (strain AEP), which were cultured as described previously”’. 
To obtain Hydra polyps bearing the bud anlage, we collected living animals using 
the following procedure. Newly detached small polyps (B bud generation, Fig. 1c) 
were collected within a defined time window (that is, every 6 h). These animals were 
cultured at a density of 1-1.3 hydracm * and fed and washed daily. Under these 
conditions, the detached daughter polyps start the formation of new buds (C bud 
generation, Fig. 1c) after 70-76 h. The budless Hydra polyps were treated with in- 
hibitors 24-48 h after detachment and were fixed for WISH analyses at 72 h after 
detachment. 

Gene isolation. Partial open reading frame (ORF) sequences of Hydra genes, in- 
cluding those encoding Tgf-B and Dan family proteins and transcription factors, 
were obtained by database searches on GenBank and Hydrazome. Gene-specific 
primers were designed to recover both 3’ and 5’ RACE (rapid amplification of CDNA 
ends) fragments using the GeneRacer kit (Invitrogen) with annealing temperatures 
between 60 °C and 65 °C. The RACE products were cloned using pGEM-T (Pro- 
mega) and sequenced. Overlapping 5’ and 3’ RACE fragments were aligned to obtain 
messenger RNA sequences. 

WISH and qPCR. WISH studies were performed as described previously”!*”. For 
qPCR experiments, total RNA was isolated from embryos with the use ofan RNeasy 
kit (QIAGEN) and reverse transcribed into cDNA. qPCR analysis was carried out 
using a DNA Engine Thermal Cycler equipped with a Chromo4 Real-Time Detec- 
tor (Bio-Rad). The qPCR primer sequences used to amplify H. magnipapillata cDNA 
are as follows: Ndr forward, 5'-GCACATCCGCTGACAGTTAC-3’; Ndr reverse, 
ACTCTGTTGGAACACAGCAAGG-3’; Pitx forward, 5'-CGAGCAGCCAAGC 
TTICTAATGT-3’; Pitx reverse, 5’-CAGCTGTGTATCCGCTCGTA-3’; Bral for- 
ward, 5'-TCACAGGTGGAAATACGTAAATGGA-3’; Bral reverse, 5'-ATCGG 
GCTTTTCATCCAATGTGT-3’. 

Inhibitor treatment. To examine the effect of Nodal and Activin inhibitors on bud 
induction, young budless animals (1-2 days after detachment) were treated with 
an increasing concentration of A-83-01 (Calbiochem), SB-505124 (Sigma), LY- 
364947 (Sigma) or SD-208 (Sigma). The data shown in Fig. 2e and Extended Data 
Fig. 9 are presented as the mean + s.e.m. of four experiments (nm = 82 dimethyl- 
sulphoxide (DMSO), n = 96 A-83-01, m = 83 SB-505124, n = 87 LY-364947 and 
n= 81 SD-208). In the experiment shown in Fig. 2f, 120 and 119 budless animals 
were used in the control (DMSO) and experimental (40 nM A-83-01) condition, 
respectively. The data are presented as the mean + s.e.m. of four experiments. To 
check the effect of A-83-01 on bud development, animals bearing buds at stage 3 
were treated with 40 nM A-83-01 (n = 71 DMSO and n = 66 A-83-01 in three in- 
dependent experiments). To induce expanded and radialized bud formation, budless 
animals were treated with 40 nM A-83-01 for 3 days and cultured for 1 day (Fig. 3a) 
or 3 days (Fig. 3b) without the inhibitor. The number of buds showing biradial, 
expanded or radial morphologies (n = 222 control and n = 324 A-83-01 in three 
independent experiments) and the number of tentacles per polyp (n = 60 control 
and n = 28 A-83-01 in two independent experiments) were scored. Animals were 
fed and washed each day in hydra medium containing the inhibitors. The GSK3 
inhibitor alsterpaullone (ALP) (Calbiochem) was used at 0.2 UM and 1 |.M. For the 
qPCR experiments shown in Fig. 4b, budless animals were treated with ALP for 
24h (n = 4). Animals were treated with 1-2.5 1M and 5 uM B-Catenin inhibitor 
iCRT14 (Sigma) to examine the effect on budding and Ndr and Pitx expression, 
respectively. Dose-dependent iCRT14 inhibition of bud initiation (n = 48 DMSO, 
n= 47 1M iCRT14 and n = 48 2.5 1M iCRT14) and bud development (n = 58 
DMSO, n = 59 1 uM iCRT14 and n = 60 2.5 UM iCRT 14) were examined in three 
independent experiments for each. In all experiments using chemical inhibitors, 
the control medium contained an equal concentration (less than 0.1% (v/v)) of DMSO. 
Treatment with recombinant H. magnipapillata Ndr protein. Recombinant His- 
tagged Ndr protein was expressed in Escherichia coli and purified, and the effect on 
bud formation was examined as described previously with minor modifications*’. 
For Ndr experiments, a piece of mid-gastric tissue (one eighth of the body column) 
was isolated from budless animals of transgenic H. vulgaris AEP in which green 


fluorescent protein (GFP) was expressed in ectodermal epithelial cells under the 
control of the Hydra Actin promoter and treated with BSA (control) or recom- 
binant Ndr protein for 3 hin hydra medium containing 20% DMEM (Sigma). After 
treatment, the tissue explants were transplanted into untreated budless polyps of 
the normal AEP strain (n = 58 for BSA and n = 58 for Ndr in four independent 
experiments). 

Overexpression and knockdown of H. magnipapillata Ndr and Pitx. For the 
expression plasmid, the ORF of Ndr was cloned into the vector pBSAA under the 
control of the Hydra Actin promoter*. For the control experiment, empty pBSAA 
or pBSAA-GFP vectors were used. The numbers of animals used was 432 for the 
controls and 434 for the Ndr studies in 12 independent experiments. The siRNAs 
specific for GFP (AAGATGGAAACATTCTTGGAC), Ndr (Ndr siRNA2, AAGG 
AAACAATATTTCTACGA; and Ndr siRNA3, TGGATTACATTTGAAGTAAC 
A) and Pitx (Pitx siRNA1, ATGTGGAAAATAAAGAAGATACG; and Pitx siRNA2, 
GAGTTGATAATCGCATAAGAAGA) were purchased from QIAGEN. Trans- 
fection of the plasmid or siRNAs was carried out as reported previously™ with 
minor modifications. As Ndr and Pitx are expressed in different germ layers (that 
is, in the ectoderm and endoderm, respectively), we used transgenic Hydra strains 
in which either the ectoderm or endoderm was labelled with GFP or red fluor- 
escent protein (RFP), respectively’. Budless animals were collected into a Petri 
dish from cultures fed every second day. The animals were washed five times with 
Milli-Q water and left for 30-60 min. Polyps (20-25) were placed into an electro- 
poration cuvette with a 4-mm gap (Bio-Rad), and as much water was removed as 
possible. After adding 200 il sterilized 10 mM HEPES (pH 7.0) containing 10 pig 
expression vector or 2 LM siRNA, the cuvettes were shaken by tapping ten times to 
evenly distribute the animals and the plasmid or siRNA. The cuvettes were left for 
5 min to allow the animals to relax and extend. A Bio-Rad Gene Pulser II electro- 
poration system equipped with a radio-frequency (RF) module was adjusted (for 
plasmids, 60 V, 1 burst, 50 ms burst duration; for siRNA, 50 V, 1 burst, 10 ms burst 
duration). Immediately after administering each pulse, 500 jl restoration medium 
containing 80% hydra medium and 20% hyperosmotic dissociation medium (6 mM 
CaCl, 1.2mM MgSO,, 3.6mM KCl, 12.5mM N-Tris-[hydroxymethyl]methyl-2 
aminoethanesulphonic acid, 6 mM sodium pyruvate, 6 mM sodium citrate, 6 mM 
glucose, and 50 mg ml~ rifampicin, pH 6.9) was added to the cuvette. Animals were 
then carefully transferred to Petri dishes (6-cm diameter) containing 10 ml restora- 
tion medium and incubated for 1 day to allow recovery. Viable polyps were sepa- 
rated from cell debris and transferred to new Petri dishes containing 100% hydra 
medium, cultured for 1 day and used for experiments. In the siRNA-mediated Ndr 
knockdown experiments, we performed six independent experiments (n = 139 for 
GFP-specific siRNA, n = 138 for Ndr siRNA2 and n = 136 for Ndr siRNA3). For 
Pitx knockdown, we performed three independent experiments (n = 36 for GFP- 
specific siRNA, n = 40 for Pitx siRNA1 and n = 35 for Pitx siRNA2). 

Statistics. The precise numbers of experimental animals are reported above. All 
statistical analyses were performed with a one-sided, unpaired Student’s t-test. The 
data are shown as the mean = s.e.m. No animal or sample was excluded from the 
analysis. Sample randomization and blinding were not applied, as the treatments 
used were not targeted at specific sites or organs of the animal. P < 0.05 was con- 
sidered to indicate statistical significance. 
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Extended Data Figure 1 | Phylogenomic distribution of Nodal-signalling (Extended Data Figs 2, 4-6). Genes labelled with pink evolved in Bilateria; 
genes. Rows, species. Columns, gene members involved in Nodal signalling; orange in Eumetazoa; yellow, in Metazoa; blue, in Unikonta. 
white asterisks indicate low statistical support in phylogenomic analyses 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


BMP2/4 


XlaDerriere 
XlaVg-1 


43/-/57 


CpaBmp2/4 


MmuGdf1 


DreVgr1 
MmuGef3 


Spul 


puUnivin 

L[-—— skoUnivin-tike 
BrlUnivin 
XtrGi 


63/49/71 


-DreGdiga 
ea ae 
NveGdf5-like 
‘AmiGdt5-like 
xing Oks 
rm -like 
a Mmusmp 10 
‘DreBmp10 
SkoBmp10-like 


BflBmps-8 
‘SkoBmp5-8 


94/90/99 


XlaBmp7 
68/61/86 —f MmuBmp7 
DreBm| 
e Nvebmpoes Hipatingé 
imaBmp5-8¢ 


Hmabi 


DreBmp3 
BflBmp3 
SkoBmp3-like 

SpuBmp3 
— XlaAdmp 
SON 
wAdm 
BA 
—Drendme: 
Bilt gt 


83/82/90 Oe 
rs L Aditgi 


71/55/84 
e 


97/93/99 


HmaBmp5-86 


mp5-8a, 


AmiTogft 


AmiNodal-related B 


HmaBmp2/4 


GDF1/3 


GDF5/6/7 


BMP10 


BMP5/6/7/8 


BMP3 


ADMP 


NODAL 


AmiNodal-related A 


BfiNodal-related 


CpaNodal-related 
OcaTafi 


Amilgi4 
— aAditot4 


MleNodal-related 


HmaNodal-related 


OcaTgf2 


T AdiNodal-related A 
NODAL-RELATED 


-—————-1 


LgiTgf1 
LgiTgf2 giTaF 


HmaActivin-like4 


Hmadctivin-like3 


HmadActivin-like2 


43/70/70 


56/65/75 


MmulnhibinBA 


XirGdf8-like 
MmuGats. 
reGdfe 


OcaActivin-like 


HmaActivin-like1 
SkoActivin-like 
1 Briaetivin 


AmiActivint 
AdiActivin4 


AmiActivin2 
AAS Ne 


NveActivin 
ngeagivin 
XlainhibinBS. 
MmulnhibinBB 
DreinhibinBA 
LgiMyostatin 


SkoGat6/11-like 


ACTIVIN 


Xlalnhibin6C 


DmeMyoglianin 


GDF8/MYOSTATIIN 


Extended Data Figure 2 | Maximum likelihood phylogeny of the metazoan 
Tgt-f gene superfamily. H. magnipapillata (Hma) and Cladonema pacificum 
(Cpa) genes are shown in red, and Nematostella vectensis (Nve), Acropora 
digitifera (Adi) and Acropora millepora (Ami) genes are shown in blue. The 
numbers at the branches indicate the bootstrap supports from 200 BioNJ, 200 
IQPNNI and 100 PhyML bootstrap trees, respectively. There was reasonable 
support for the cnidarian members of the Bmp2/4, Bmp5-8, Gdf5-7 and 
Gdf8 (Myostatin) gene families. There are no cnidarian members of the Bmp3, 
Bmp10, Gdfl and Gdf3, and Lefty gene families. Anthozoan Activin genes 
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cluster only with low support with bilaterian Activin genes. Similarly, 
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Nodal-related sequences from A. millepora, A. digitifera, Branchiostoma 
floridae, C. pacificum, H. magnipapillata and Mnemiopsis leidyi cluster with the 
bilaterian Nodal genes. For these pre-bilaterian Nodal-related genes, two 
scenarios are possible. One scenario is that these Ndr genes are pre-bilaterian 
Tgf-B-based inventions that acquired the Nodal function (as has been shown 
here for H. magnipapillata). The second scenario is that these Ndr sequences 
are orthologous to the bilaterian Nodal genes; however, as the branches in 
the backbone of the tree are deep, it is not possible to verify this possibility with 
sufficient support, owing to saturation and noise. 
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Extended Data Figure 3 | Tgf-B superfamily protein structure and motifs. 
a, Conserved protein domains of bilaterian Tgf-B proteins based on consensus 
sequences of the different Tgf-f families, indicating the conserved signal 
peptide (black) and B-propeller motif (dark violet) of the prodomain (light 
violet), the furin cleavage sites (blue) of the variable region (green) and the 
ligand domain (amber) with the conserved cysteine residues. Structurally, 
bilaterian ligands of the Tgf-f superfamily fall into two groups that can be 
distinguished by their pattern of cysteine residues. The Activin and Tgf-B, Gdf8 
and Gdf11, and Lefty ligands have a conserved C/CC/CXXXC/CC/CXC 
pattern. Bmp-like ligands, including Nodal, exhibit a distinct C/CXXXC/[C]C/ 
CXC pattern, but only Nodal signals via Alk4/7 receptors, Smad2/3 and Pitx. 


ds D SUK PSH 1A -AGRD FORT V YRDIEASSeR 


Note that X denotes any amino acid except for cysteine, and [C] denotes a 
cysteine residue that is conserved in many but not all sequences. The solidus (/) 
represents stretches of amino acids separating the conserved cysteine patterns. 
Proteins encoded by cnidarian Ndr and bilaterian Nodal genes also have a 
conserved motif pattern in the cleavage sites in the variable linker region. 

b, Alignment of bilaterian Nodal and pre-bilaterian Nodal-related sequences. 
The boxes in violet, blue and red indicate the conserved patterns of the 
B-propeller motif, the furin cleavage sites and the cysteine residues, 
respectively. Note that several sequences have additional furin cleavage sites 
that are not conserved throughout the Nodal subfamily and a change in the 
cysteine pattern in some vertebrates (CC at 184-185 to CXXC at 182-184). 
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Extended Data Figure 4 | Maximum likelihood phylogeny of the metazoan (Cpa) genes are shown in red, and N. vectensis and Acropora sp. genes are 


Cerberus, Dan and Gremlin, and Lhx and Lim, transcription factor gene shown in blue. The numbers at the branches indicate the bootstrap supports 
families. a, Cerberus, Dan and Gremlin gene families. b, Lhx and Lim from 100 BioNJ, 100 IQPNNI and 100 PhyML bootstrap trees, respectively. 


transcription factor gene families. H. magnipapillata (Hma) and C. pacificum 
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Extended Data Figure 5 | Maximum likelihood phylogeny of the metazoan 


Smad gene family. H. magnipapillata (Hma) and C. pacificum (Cpa) genes are 
shown in red, and N. vectensis and Acropora sp. genes are shown in blue. 


The numbers at the branches indicate the bootstrap supports from 100 BioNJ, 
100 IQPNNI and 500 PhyML bootstrap trees, respectively. 
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Extended Data Figure 6 | Maximum likelihood phylogeny of the metazoan 
Prd gene superfamily of transcription factors. H. magnipapillata (Hma) 
and C. pacificum (Cpa) genes are shown in red, and N. vectensis and Acropora 
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sp. genes are shown in blue. The numbers at the branches indicate the bootstrap 
supports from 100 BioNJ, 100 IQPNNI and 100 PhyML bootstrap trees, 
respectively. 
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Extended Data Figure 7 | Expression patterns of Nodal-signalling-related 
genes in H. magnipapillata. a, H. magnipapillata Bmp genes exhibit a similar 
pattern of expression in endodermal cells at the oral side. All Activin genes 
show expression in the endodermal layer in the upper half of the body column, 
except for the head region. ActL2 shows a broader expression area than the 
other Activin genes do along the body length towards the foot region. Cerberus- 
like1 (CerL1), CerL2a and CerL2b are expressed at a high level in the 
endodermal layer at the base of the tentacles in mature Hydra. CerL4 was 
expressed in specific endodermal cells at the tip of the tentacles. Gremlin-like 
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(GreL) showed a ubiquitous expression pattern. The Nodal target genes Pitx 
and FoxA were expressed in the endoderm. Prominent expression was observed 
at the evaginating bud tip and head region (Pitx) or the base of the tentacles 
(FoxA). Lhx1/5 showed clear expression at the mature and developing oral tip 
and peduncle. b, At the presumptive bud region (stage 1), Pitx expression was 
clearly demonstrated, but the other genes were under the detection limit by 
WISH. The first detectable expression of the ActL1, CerL1, CerL2b and FoxA 
genes appeared at budding stage 3. 
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Extended Data Figure 8 | Ectopic overexpression of Ndr did not induce 
ectopic bud formation. a, Budless animals were electroporated with pBSAA- 
GFP (control) or pBSAA-Nadr vectors. After a 2-day incubation in hydra 
medium, cells expressing GFP or H. magnipapillata Ndr or Pitx genes were 
visualized by WISH. Ectopic overexpression of Ndr resulted in a slight increase 
in Pitx expression along the main body axis, but a new head organizer with 


Bi Gfp-expressing cells 
B HmWNar-expressing cells 


Section 


strong Pitx expression was established only in the budding zone. 

b, Quantification of transfected cells demonstrated an almost equal distribution 
of GFP- or Ndr-expressing cells along the oral-aboral axis. This finding 
indicates that the whole body column region has a similar capacity to be 


transfected with an Ndr-expressing plasmid and to express an exogenous 
Ndr gene. 
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Extended Data Figure 9 | Dose-dependent effect of inhibitors of Alk4/5/7 
and of B-Catenin. a-d, Budless animals were incubated in the absence or 
presence of specific inhibitors of Alk4/5/7: A-83-01, SB-505124, LY-364947 
and SD-208. The data are presented as the mean + s.e.m. of four independent 
experiments. e, Hydra polyps treated with 1 1M ALP (GSK3 inhibitor) for 16h 
with or without 1 or 5 uM iCRT14 (B-Catenin inhibitor). The relative 
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expression level of Pitx was examined by qPCR. The data, which were 
normalized to the internal standard EF1«, are presented as the ratio of the fold 
increase compared with the non-treated control. The data are presented as 
the mean + s.e.m. (triplicate determinations), with similar results obtained in 
three independent experiments. *, P < 0.05; one-sided t-test. 
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Extended Data Figure 10 | Functions of H. magnipapillata Ndr and Pitx in 
bud initiation. Gene-specific siRNAs were co-transfected with a GFP-specific 
siRNA, and bud formation from the side of transfection (GFP-depleted side) 
was monitored. In the mock-transfected hydra, GFP fluorescence remained 
intact after electroporation. As Ndr and Pitx are expressed in different germ 
layers (that is, in the ectoderm and endoderm, respectively), we used transgenic 
Hydra strains in which either the ectoderm or endoderm is labelled with GFP or 
red fluorescent protein (RFP), respectively*’. Bud formation from the GFP- 
depleted side was significantly suppressed by co-transfection with an Ndr- 
specific siRNA (a) or a Pitx-specific siRNA (b). Note that the RFP fluorescence 
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was not affected, indicating that the decrease in the GFP fluorescence was 
not due to tissue damage caused by electroporation. The siRNAs were 
electroporated more effectively into the ectoderm than the endoderm, because 
the GFP-positive region of the endoderm remained more expanded at the 
non-transfected side (a, b). Bud formation from the side of the GFP-positive 
endoderm was not affected in Pitx-specific siRNA transfectants (c, d), 
indicating that localized expression and function of Pitx at the budding region is 
required. The data are presented as the mean + s.e.m. of three independent 
experiments. 
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Sensory-evoked LTP driven by dendritic plateau 


potentials in vivo 


Frédéric Gambino'*+, Stéphane Pagés'*, Vassilis Kehayas'”, Daniela Baptista’, Roberta Tatti»*, Alan Carleton! 


& Anthony Holtmaat! 


Long-term synaptic potentiation (LTP) is thought to be a key pro- 
cess in cortical synaptic network plasticity and memory formation’. 
Hebbian forms of LTP depend on strong postsynaptic depolarization, 
which in many models is generated by action potentials that propa- 
gate back from the soma into dendrites”*. However, local dendritic 
depolarization has been shown to mediate these forms of LTP as well’*». 
As pyramidal cells in supragranular layers of the somatosensory cor- 
tex spike infrequently®*, it is unclear which of the two mechanisms 
prevails for those cells in vivo. Using whole-cell recordings in the mouse 
somatosensory cortex in vivo, we demonstrate that rhythmic sensory 
whisker stimulation efficiently induces synaptic LTP in layer 2/3 (L2/3) 
pyramidal cells in the absence of somatic spikes. The induction of 
LTP depended on the occurrence of NMDAR (N-methyl-p-aspartate 
receptor)-mediated long-lasting depolarizations, which bear simi- 
larities to dendritic plateau potentials”-**. In addition, we show that 
whisker stimuli recruit synaptic networks that originate from the 
posteromedial complex of the thalamus (POm). Photostimulation 
of channelrhodopsin-2 expressing POm neurons generated NUDAR- 
mediated plateau potentials, whereas the inhibition of POm activity 
during rhythmic whisker stimulation suppressed the generation of 
those potentials and prevented whisker-evoked LTP. Taken together, 
our data provide evidence for sensory-driven synaptic LTP in vivo, in 
the absence of somatic spiking. Instead, LTP is mediated by plateau 
potentials that are generated through the cooperative activity oflem- 
niscal and paralemniscal synaptic circuitry'***. 

In most cortical synaptic LTP studies in vivo, strong postsynaptic depo- 
larization was provided by action-potential-triggering somatic current 
injections’. To examine whether sensory stimuli can elicit LTP of syn- 
aptic inputs on L2/3 pyramidal cells in the mouse somatosensory cortex 
without the help of artificially triggered backpropagating action poten- 
tials, we recorded sensory-evoked postsynaptic potentials (PSPs) in the 
barrel cortex in vivo, and applied a rhythmic whisker stimulation (RWS) 
protocol that has been shown to enhance whisker-evoked local field 
potentials’. Whole-cell patch recordings were targeted to cells above 
the C2 barrel of urethane-anaesthetized mice®"* (Fig. 1a). Using a pie- 
zoelectric actuator, the principal whisker was deflected back and forth 
(100-ms deflections) for 1 min at a frequency of 8 Hz, which is within 
the range of frequencies at which mice sample objects’*. In all cells, RWS 
evoked a sustained subthreshold depolarization (Fig. 1a), which was rem- 
iniscent of an evoked, NNUDAR-dependent, cortical upstate’’*°. None 
of the recorded cells displayed somatic action potentials during RWS. 
On average, RWS elicited a significant potentiation of subsequent sin- 
gle (0.1 Hz) whisker-deflection-evoked short-latency PSP amplitudes 
(PSPs hort} Fig. 1b, c; values for Figs 1-4 are provided in Supplementary 
Information). This LTP lasted for as long as the cells could be recorded 
from (at least 15 min after RWS), and correlated with the decay time of 
the sustained depolarization (Extended Data Fig. 1a, b). Pharmacolo- 
gical or hyperpolarization-mediated suppression of NMDAR conduc- 
tance specifically attenuated the RWS-evoked sustained depolarization, 


and prevented LTP (Fig. 1d-i, Extended Data Fig. 1 and Supplementary 
Note 1). RWS-induced LTP remained specific to the rhythmically stim- 
ulated synaptic pathway (Extended Data Fig. 2 and Supplementary Note 2), 
increased gradually over a timescale of minutes, and was occluded upon 
asecond RWS 10 min after the first stimulation (Extended Data Fig. 3a, b). 
RWS and hyperpolarization did not significantly alter intrinsic cell mem- 
brane properties, and small changes thereof did not correlate with the 
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Figure 1 | Rhythmic whisker stimulation induces LTP in L2/3 neurons. 

a, Top, schematic of recordings in L2/3 cells in vivo. PSPs and RWS are evoked 
by the principal whisker (wC2). Bottom, example trace of sustained 
depolarization induced by RWS (8 Hz for 1 min; black bar). b, Single-cell 
whisker-evoked short latency PSP amplitudes (PSPshort3 see ¢) before and 
after RWS. c, Top, single-cell averaged traces, pre and post RWS. Grey box, 
PSP.hort Window. Bottom, mean amplitudes, pre and post RWS (n = 11, 

**P = 0.008). d, Cumulative RWS-induced depolarization. e, Mean cumulative 
depolarization at the end of RWS and amplitudes of the first PSP.yo1 upon 
RWS (control, n = 11; +DAP5, n = 7; P = 0.028 (left) and P = 0.703 (right)). 
f, Single-cell whisker-evoked PSP.,or+ amplitudes upon epidural DAP5 (1 mM). 
g, Top, single-cell averaged traces, pre and post RWS under DAPS. Bottom, 
mean amplitudes, pre and post RWS under DAP5 (n = 7, P = 0.344). h, Mean 
PSPshort amplitudes without RWS, and pre and post RWS in controls and under 
DAPS5. i, Mean amplitudes normalized to baseline (RWS, n = 11; RWS 
+DAP5, n = 7; Control, n = 7; P< 0.009, one-way ANOVA; *P< 0.05, 
post-hoc comparisons versus RWS control). Error bars, s.e.m; square pulse 
lines, whisker deflections (100 ms); grey lines between bars, pairs. 
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Figure 2 | Whisker deflections evoke NUDAR-mediated plateau potentials. 
a, Mean whisker-evoked 100-ms PSP integrals in controls (grey, n = 7), and 
upon epidural DAP5 (blue, n = 5). b, Single-cell averaged traces (n = 40 trials). 
NMDAR-mediated plateau: DAP5-trace subtracted from control 

trace. c, Mean Indexpjateau (see Extended Data Fig. 4) of same cells as in a. 

d, Mean PSP integrals and plateau probabilities (n = 5, **P = 0.004 (left) and 
**P = 0.002 (right)). e, Single-cell examples of responses at two different 
holding potentials (grey, single trials; blue, averaged traces). f, Top, averaged 
traces from e. Bottom, traces normalized to PSP. nor; in order to compensate for 
increased driving force, showing that hyperpolarization blocks the plateau. 

g, Mean PSP integrals and plateau probabilities (n = 11, ***P < 0.001). 

h, Single-cell averaged traces under control (dark blue) and various NUDAR 
conductance-reducing conditions. i, Comparison of mean plateau potential 
probabilities and integrals under various conditions (control, n = 44; 10 uM 
DAPS, n = 9; 1 mM DAP5, n = 12; iMK801, n = 10; Hyper, n = 11; P< 0.001 
(top) and P = 0.007 (bottom), one-way ANOVA; *P < 0.05, post-hoc 
comparisons versus control). j, Single-cell PSPsnorr amplitudes recorded at 
resting membrane potential (—70 mV) before and after RWS. The cell was 
hyperpolarized (— 100 mV) during RWS. k, Top, single-cell averaged traces, pre 
and post RWS upon hyperpolarization. Grey box, PSP nor Window. Bottom, 
mean amplitude, pre and post RWS recorded at —70 mV, but upon 
hyperpolarization during RWS (n = 9, P = 0.993). 1, Normalized plateau 
strength predicts the level of RWS-induced LTP in controls (nonlinear 
regression; R? = 0.76, P= 0.002). DAPS5 blocks plateaus and LTP. Neurons 
with high plateau strengths fail to potentiate when hyperpolarized during RWS 
(PSP hort, Mean: 103.6 + 6%, n = 9). Error bars, s.e.m; square pulse lines, 
whisker deflections (100 ms); grey lines between bars, pairs. 


magnitude of LTP (Extended Data Fig. 3c—n). Together, the data indi- 
cate that RWS elicits bona fide synaptic LTP by evoking a cell autono- 
mous and sustained NMUDAR-dependent subthreshold depolarization. 

We next investigated what may cause the sustained subthreshold depo- 
larization upon RWS. In accordance with previous reports single whisker 
deflections typically evoked compound PSPs, containing short- and long- 
latency components, which is similar to the sparse, temporal, spiking pro- 
files of L2/3 cells (Fig. 1c and Extended Data Fig. 4a, b)**"”*. Short-latency 
PSPs were always present. Long-latency PSPs occurred with variable prob- 
abilities in different cells. They had an all-or-none, nonlinear appearance 
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Figure 3 | Plateau potentials depend on POm activity. a, b, POm neurons 
were transfected with adeno-associated virus (AAV)-GCaMP5G and their 
axons were imaged in L1 of the barrel cortex (b). c, Example traces of Ca** 
signals (AF/F) in POm axonal boutons upon whisker deflection bouts (5 at 
20 Hz; grey lines). d, Example of a responsive (blue arrowhead) and an 
unresponsive (red arrowhead) axon. Scale bar, 5 tum. e, Top, whisker-evoked 
(dashed line) Ca?* signals (AF/Fo) in the boutons of d (rows, 10 successive 
trials). Bottom, mean whisker-evoked Ca?* signals in the two boutons. 

f, Photostimulation of ChR2-expressing POm neurons using an optical fibre, 
and ChR2 expression profiles in the thalamus (inset) and cortex. g, Single-cell 
examples of photostimulus-evoked PSPs (in L2/3 cells) before (left) and 

after DAP5 (right). Grey, single trials; blue, averaged traces; black bar, 
photostimulus. h, Photostimulus-evoked PSP integrals as a function of stimulus 
duration (normalized to the maximal integral upon a 100-ms pulse +DAP5 
(control, n = 7; DAP5, n = 5; **P = 0.002)). i, Averaged 100-ms light-pulse- 
evoked PSPs from g. NMDA-plateau: DAP5-trace subtracted from control 
trace. j, Comparison of 100-ms light pulse-evoked and whisker-evoked PSPs 
(whisker, n = 33; light, n = 11; P = 0.918 (top); **P < 0.01 (bottom)). 

k, Coronal brain section with fluorescent muscimol in the POm. I, Single-cell 
averaged traces of 100-ms whisker-evoked PSPs in controls and after POm 
inactivation, with and without DAP5. m, Effects of POm inactivation on 
PSPshort (control, n = 33; muscimol, n = 9; P = 0.095), plateau potential 
probabilities (control, n = 44; muscimol, n = 9; ***P < 0.001, Mann-Whitney 
U-test), and PSP integrals (over 300 ms; control, n = 33; muscimol, n = 9; 
100 ms light, n = 13; P = 0.002, one-way ANOVA, and *P < 0.05, post-hoc 
comparisons versus control). Error bars represent s.e.m. 


and were selectively attenuated upon an NMDAR block (Fig. 2a-d and 
Extended Data Fig. 4a—g). This indicates that long-latency PSPs share 
similarities with plateau potentials”. We termed them accordingly. 
Importantly, plateau potentials, but not short-latency PSPs, were also 
significantly attenuated upon a cell-autonomous suppression of NUDAR 
conductance using artificial hyperpolarization (— 100 mV), or intracel- 
lular MK801 (1 mM) (Fig. 2e-i and Extended Data Fig. 4g). Conversely, 
when holding the neurons ata slightly depolarized state, or in some occa- 
sions at resting states, these potentials evoked spikes (Fig. 2e). This sug- 
gests that whisker-evoked long-latency spiking under normal conditions 
is associated with the occurrence of NNUDAR-dependent plateau poten- 
tials (Extended Data Fig. 4b)'*!*?!**, The occurrence of plateau poten- 
tials was not critically dependent on the 100-ms duration of the whisker 
deflections (Extended Data Fig. 4h-j). 
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Figure 4 | RWS-evoked LTP depends on POm-mediated plateau potentials. 
a, Example of POm axonal bouton Gat dynamics (AF/Fy) upon RWS. b, Ca2t 
signals in POm boutons upon whisker-deflection bouts (grey lines) and 
RWS. c, Mean RWS-induced Ca** signals in the 25 most active (blue) and 
least active (red) boutons (see Extended Data Fig. 6). Shaded area, s.d. 

d, Integrated responses before, during and after RWS (n = 765; ***P < 0.001). 
e, Example of the membrane depolarization (L2/3 cell) upon rhythmic 
photostimulation of POm (20 flashes at 8 Hz) or RWS (20 stimuli at 8 Hz). 
Black bar, stimulus period. f, RWS-induced membrane depolarization upon 
muscimol-POm inactivation or DAP5. g, Mean integrated membrane 
potentials (over 2.5 s) (control, n = 14; muscimol, n = 4; DAP5, n = 15; 

light 8 Hz, n = 5; P= 0.01, one-way ANOVA; *P < 0.05 and **P < 0.01, 
post-hoc comparisons versus control). h, Single-cell whisker-evoked PSP snort 
amplitudes before and after RWS upon muscimol-mediated POm inactivation. 
i, Top, single-cell averaged traces, pre and post RWS. Grey line, PSP short 
window. Square pulse line, whisker deflection (100 ms). Bottom, mean 
amplitudes pre and post RWS upon POm inactivation (n = 9, P = 0.972). Grey 
lines between bars, pairs. j, Comparison of normalized PSP.,o; amplitudes 
upon RWS in controls, and after a POm or NMDAR block (control, n = 11; 
muscimol, n = 9; DAP5, n = 7; Hyper, n = 9; P = 0.015, one-way ANOVA, 
and *P < 0.05, post-hoc comparisons versus control). Error bars, s.e.m. 


NMDAR-mediated plateau potentials or summating NMDA spikes 
have been observed in various cell types in the somatosensory cortex in 
vivo'®'*?, where they are characterized by local and spreading dendritic 
Ca’* transients. We confirmed that under our conditions too, single whis- 
ker deflections evoke Ca?* transients in spines and dendritic shafts, both 
under wakefulness (data not shown) and anaesthesia. Local and large- 
scale events occurred at various positions in the dendritic tree (Extended 
Data Fig. 5a—e and Supplementary Note 3). These responses were dimin- 
ished upon an NMDAR block, and increased upon RWS (Extended Data 
Fig. 5f-j), similar to the plateau potentials and sustained depolariza- 
tion in our whole-cell recordings. Together, this indicates that single- 
whisker-deflection-mediated plateaus and the RWS-mediated sustained 
depolarization in our recordings are likely to be supported by NMDAR- 
mediated dendritic Ca”* events!1"9, 

The plateau strength of neurons (that is, the product of the probability 
for a whisker deflection to elicit a plateau potential and the average inte- 
grated depolarization) correlated with the magnitude of RWS-induced 
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LTP (Fig. 21). Neurons bearing high plateau strengths could not be poten- 
tiated when they were hyperpolarized during RWS (Fig. 2j-l). This strongly 
suggests that the plateau potentials are essential for RWS-induced LTP. 

Next, we questioned what could be the synaptic source of NNUDAR- 
mediated plateau potentials. Previous studies indicate that in the somato- 
sensory cortex they are generated by coincident activity of segregated 
excitatory synaptic pathways'®''*. We reasoned that in anaesthetized 
mice plateau potentials may depend on co-activation of intracolumnar 
lemniscal and thalamocortical paralemniscal pathways'®"'. Paralemnis- 
cal input to L2/3 cells may be directly or indirectly provided by thal- 
amic POm efferents that target pyramidal cell dendrites in L5A and 
L1 (refs 14-16, 26-29). This system is involved in the kinematics of 
whisking’* and may provide feedback information to pyramidal cells’’. 

To test whether POm efferents are activated by passive whisker stim- 
uli!®?8°, we expressed the genetically encoded Ca’ * indicator GCaMP5G 
in POm neurons (Fig. 3a, b), and imaged responses in their projections 
to somatosensory cortex L1 through a cranial window. Whisker deflec- 
tions evoked Ca” * transients with various latencies in a substantial por- 
tion of the terminals in awake and anaesthetized mice (Fig. 3c—-e and 
Extended Data Fig. 6). The shortest response times matched the laten- 
cies of the plateau potentials. Furthermore, the pattern of activation 
was widespread and did not remain limited to the whisker’s home barrel 
column (Extended Data Fig. 7a—c and Supplementary Note 4). Interest- 
ingly, and in contrast to short-latency PSPs, the plateau potentials were 
not selective for the principal whisker, which supports the possibility 
that the POm thalamocortical circuitry is indeed involved in generat- 
ing them (Extended Data Fig. 7d, e). 

To test this directly we expressed the recombinant light-gated ion chan- 
nel channelrhodopsin-2-Venus (ChR2-Venus) in the POm and recorded 
photostimulus-evoked PSPs in L2/3 cells (Fig. 3f, g and Extended Data 
Fig. 8a-c and Supplementary Note 5). POm neurons were stimulated 
using an optical fibre that was guided by a stereotactically implanted 
cannula (Fig. 3f). Short square light pulses (10-20 ms) produced small- 
amplitude PSPs, longer pulses (50-100 ms) produced plateaus (Fig. 3g—h). 
This indicates that a strong activation of POm thalamic nuclei may gen- 
erate plateau potentials by itself, through monosynaptic inputs (Extended 
Data Fig. 8d, e and Supplementary note 5) or, more probably, through 
the large-scale recruitment of cortical paralemniscal synaptic networks 
that project to L2/3 cells'*’’””*. This suggests that the POm-associated 
synaptic circuitry mediates whisker-evoked plateau potentials. Indeed, 
analogous to whisker-evoked plateaus, photostimulus-evoked plateaus 
were eliminated upon an NMDAR block (Fig. 3g, h). The full width at 
half maximum of the NMUDAR-mediated plateaus was equal between 
whisker and 100-ms light stimuli (Fig. 3i, j; for example, compare with 
Fig. 2b). Importantly, for these photostimuli the peak amplitude of the 
short-latency PSPs was significantly lower than that of whisker-evoked 
PSPs (Fig. 3j), confirming that photostimuli did not generate plateau 
potentials through a large-scale recruitment of lemniscal synaptic pathways. 

To test further the role of this paralemniscal circuitry, we specifically 
suppressed POm activity using the GABA-A-R selective agonist mus- 
cimol (Fig. 3k, Extended Data Fig. 9a and Supplementary Note 6). This 
did not affect whisker-evoked short-latency PSPs, but greatly reduced 
whisker-evoked plateau potential probabilities (Fig. 31, m). The inhibition 
of POm activity was sufficient to eliminate most of the evoked NMUDAR- 
mediated plateaus, since an additional NMDAR block did not further 
reduce them (Fig. 31). 

The role of the POm in the generation of plateau potentials suggests 
that it is also involved in the production of the RWS-evoked sustained 
depolarization. Indeed, RWS evoked sustained Ca’ * transients ina portion 
of POm-derived axonal boutons (Fig. 4a—d and Extended Data Fig. 6e, f). 
Rhythmic photostimulation of ChR2-expressing POm neurons evoked 
a sustained depolarization similar to RWS (Fig. 4e, g). Conversely, RWS 
failed to evoke a sustained depolarization upon muscimol-mediated sup- 
pression of POm activity, similar to the effect of DAP5 (Fig. 4f, g). This 
suggests that POm activity evokes multiple plateaus during RWS. To test 
the causal relationship between plateau potentials, RWS-evoked sustained 
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depolarization, and RWS-induced LTP without blocking NMDARs, 
we applied RWS to muscimol-injected mice. In these mice RWS failed 
to induce LTP (Fig. 4h, i), similar to the effect of suppressing NUDAR 
conductance (Fig. 4j). This strongly suggests that plateau potentials, 
driven by the co-activation of POm-associated synaptic circuitry facil- 
itate whisker-evoked LTP. 

In summary, our data provide evidence for sensory-evoked LTP that 
is independent of somatic spikes, displaying similarities to in vitro exper- 
iments in hippocampus”. In vivo, the occurrence of LTP in the absence 
of somatic spikes may be an important mechanism to strengthen syn- 
apses between weakly connected neurons, without the necessity for sen- 
sory inputs to first elicit a sufficient number of action potentials. Since 
L2/3 neurons in the somatosensory cortex normally spike sparsely or 
infrequently®, this mechanism may also prevent neurons from losing 
synaptic input due to spike-timing-dependent long-term depression 
(LTD)-like processes’. 

Wealso found that sensory-evoked LTP may depend on paralemniscal 
synaptic inputs that originate from the POm of the thalamus. This cir- 
cuitry provides contextual or predictive information for external sensory 
stimuli that feed forward through lemniscal pathways'®. Our data sug- 
gest that the repeated coincident activity of this feedback circuitry may 
increase L2/3 neurons’ sensitivity to future sensory stimuli. It is likely 
that during wakefulness dendritic plateau potentials that are mediated 
by inputs from motor cortex" also function to facilitate this—perhaps 
Hebbian—form of LTP (Supplementary Note 7). 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


All procedures were carried out in accordance with protocols approved by the ethics 
committee of the University of Geneva and the authorities of the Canton of Geneva. 
Surgery and intrinsic optical imaging. Three- to four-week-old male C57BL/6 mice 
were used. Anaesthesia was induced using isoflurane (4% with ~0.5 1 min ! O,) 
and then continued using an intraperitoneal (ip.) injection of urethane (1.5 gkg ', 
in lactated ringer solution containing (in mM) 102 NaCl, 28 Na-L-lactate, 4 KCl, 
1.5 CaCl,). Body temperature was maintained at 37 °C by a feedback-controlled 
heating pad (FHC). Eye ointment was applied to prevent dehydration. In accordance 
with Swiss Federal laws, analgesia was provided by local application of lidocaine (1%) 
and ip. injection of buprenorphine (Temgesic, 0.05 mg kg” '). The skin was dis- 
infected with ethanol 70% and betadine, and a custom-made plastic chamber was 
attached to the skull above the barrel cortex using dental acrylic and dental cement 
(Jet Repair Acrylic, Lang Dental Manufacturing). The chamber was filled with sterile 
cortex buffer (in mM: 125 NaCl, 5 KCI, 10 glucose, 10 HEPES, 2 CaCl, and 2 MgSO, 
(pH 7.4)) and sealed with a glass coverslip. 

Intrinsic optical signals were imaged as described previously'’, through the intact 

skull using a light guide system with a 700-nm (bandwidth of 20 nm) interference 
filter and a stable 100-W halogen light source. An image of the surface vascular 
pattern was taken using green light (546-nm interference filter) at the end of each 
imaging session. Images were acquired using the Imager 3001F (Optical Imaging, 
Mountainside, NJ) equipped with a large spatial 256 X 256 array, fast readout, and 
low read noise charge-coupled device (CCD) camera. The size of imaged area was 
adjusted by using a combination of two lenses with different focal distances (Nikon 
50 mm, bottom lens, 135 mm, upper lens, f2.0; total magnification 2.7). The CCD 
camera was focused on a plane 300 jim below the skull surface. Responses were visu- 
alized by dividing the stimulus signal by the baseline signal, using the built-in Imager 
3001F analysis program (Optical Imaging, Mountainside, NJ). Signals were ana- 
lysed further using a custom routine in Matlab as described previously"*. 
In vivo whole-cell recordings. After imaging, adequate anaesthesia was assessed 
(absence of toe pinch reflexes, corneal reflexes, and vibrissae movements) and pro- 
longed by supplementary urethane (0.15 g kg” ', ip.) ifnecessary. A small ~1 X 1mm 
craniotomy (centred above the C2 whisker maximum intrinsic optical signal res- 
ponse) was made using a pneumatic dental drill. The dura was left intact. Whole- 
cell patch-clamp recordings of L2/3 pyramidal neurons were obtained as described 
previously'*°". High positive pressure (200-300 mbar) was applied to the pipette 
(5-8 MQ) to prevent tip occlusion. After passing the pia the positive pressure was 
immediately reduced to prevent cortical damage. The pipette was then advanced in 
1-11m steps, and pipette resistance was monitored in the conventional voltage clamp 
configuration. When the pipette resistance suddenly increased, positive pressure 
was relieved to obtain a 3-5-GQ seal. After break-in, Vm was measured, and dial- 
ysis was allowed to occur for at least 5 min before deflecting the whisker. Data were 
acquired using a Multiclamp 700B Amplifier (Molecular Devices), and digitized at 
10 kHz (National Instruments), using Matlab-based Ephus software (http://research. 
janelia.org/labs/display/ephus; The Janelia Farm Research Center). Offline analysis 
was performed using custom routines written in IGOR Pro (WaveMetrics). 

Current-clamp recordings were made using a potassium-based internal solution 
(in mM: 135 potassium gluconate, 4 KCl, 10 HEPES, 10 Na2-phosphocreatine, 
4 Mg-ATP, 0.3 Na-GTP and 25 tM AlexaFluor 488 hydrazide (Invitrogen), pH 
adjusted to 7.25 with KOH, 285 mOsM). Series and input resistance were moni- 
tored with a 100-ms long-lasting hyperpolarizing square pulse 400 ms before each 
whisker deflection and extracted offline by using a double-exponential fit. Record- 
ings were discarded ifthe change in these parameters was larger than 30%. The bridge 
was usually not balanced, and liquid junction potential was not corrected. Traces 
were analysed as described previously'*. For the action-potential analysis in Extended 
Data Fig. 3, action potentials were time-aligned to their respective threshold. The 
action-potential threshold was computed as the minimal membrane potential value 
at the time corresponding to the peak of the third derivative of the membrane poten- 
tial, similar to methods described previously”. 

PSPs were evoked by back and forth deflection of the whisker (100 ms, 0.133 Hz) 
using a glass capillary 4 mm away from the skin attached to a piezoelectric ceramic 
actuators (PL-series PICMA, Physik Instrumente). The voltage applied to the actu- 
ator was set to evoke a whisker displacement of 0.6 mm with a ramp of 7-8 ms (for 
details see ref. 18). The C1 and C2 whiskers were independently deflected by dif- 
ferent piezoelectric elements. 

DAP5 (1 mM or 10 1M, Tocris) was topically applied to the dura mater, either 
prior or during the whole-cell recordings. Epidural application of 0.1-1 mM DAP5 
has been shown to leave spiking in the thalamus intact and only minimally impact 
cortical L4 after 6 h of superfusion; 10 |1M is entirely ineffective in suppressing spikes 
in L4 (ref. 24). MK-801 (1 mM, Tocris) was included in the internal solution. After 
break-in we waited for at least 5 min to let MK-801 diffuse into the cells. Typically, 
the effects of MK-801 became visible over the time course of 4-5 min, and remained 


stable thereafter. This indicates that the effects were largely due to cell autonomous 
blockage of NMDARs"”. 
Injection of fluorescent muscimol into the POm. Mice were anaesthetized as 
described above. Analgesia was provided by local application of lidocaine and an 
ip. injection of buprenorphine. A burr hole was made to stereotactically inject fluo- 
rescent muscimol (Bodipy-TMR-X, 500 1M in cortex buffer with 5% DMSO, Invi- 
trogen) in the POm. The caudal sector of the POm that mainly projects to L1 of 
S1 (ref. 33) was specifically targeted using the following stereotaxic coordinates: 
rostrocaudal (RC), —2.00 mm; mediolateral (ML), — 1.20 mm; dorsoventral (DV), 
—3.00 mm from the bregma (Extended Data Fig. 9). Glass pipettes (Wiretrol, Drum- 
mond) were pulled, back-filled with mineral oil, and front-loaded with the muscimol 
solution (100-150 nl were delivered (20 nl min~') using an oil hydraulic manip- 
ulator system (MMO-220A, Narishige)). For controls, the same volume of fluo- 
rescent muscimol was injected in thalamic structures that are not directly involved 
in somatosensory processing (Extended Data Fig. 9). The craniotomy was then 
covered with Kwik-Cast (WPI) and mice were prepared for intrinsic optical imag- 
ing and whole-cell recordings as described above. To achieve a maximal suppres- 
sion of neuronal activity, patch-clamp recordings were performed at least one hour 
but no longer than 4h after the injection. After completion of the experiment, mice 
were transcardially perfused with 4% paraformaldehyde in PBS (PFA), their brains 
extracted and post-fixed in PFA overnight. Coronal brain sections (100 jum) were 
made to confirm the site and spread of injections (Extended Data Fig. 9). 
Virus injections and cranial windows. For virus injections we used either pups 
between postnatal days 12 and 15 (P12-P15) or adults (>4 weeks). In adults, anaes- 
thesia was induced using isoflurane (4% with ~0.51 min /Q,) and then continued 
using an i.p. injection of a mixture containing medetomidin (Dorbene, 0.2 mg kg’), 
midazolam (Dormicum, 5 mg kg” ') and fentanyl (Duragesic, 0.05 mg kg” ') in 
sterile NaCl 0.9% (MMF-mix). To prevent potential inflammation, salivary excre- 
tions or bradycardia, carprofenum (Rimadyl, 5 mg kg”) and glycopyrrolate (Robinul, 
0.01 mg kg” ') were injected subcutaneously (s.c.) before the surgery. Pups were 
injected under isoflurane anaesthesia. Mice were placed in a stereotaxic frame, the 
skin was disinfected with ethanol 70% and betadine, an incision was made, and 1% 
lidocaine was topically applied to the wound edges for additional local anaesthesia. 
The bregma and lambda were horizontally aligned. A burr hole was made using a 
pneumatic dental drill. Injections were targeted to the caudal part of the POm (coor- 
dinates from bregma: RC, —2.20 mm; ML, — 1.20 mm; DV, —3.00 mm for adults; 
RC, —1.45mm; ML, — 1.60 mm; DV, —2.90 mm for pups)””. AAV2/1-CAG-ChR2- 
Venus (200-500 nl for adults; 50 nl for pups; 1.5 X 10!? GC, UNC Vector Core; 
based on the pACAGW-ChR2-Venus-AAV plasmid, Svoboda laboratory, Janelia 
Farm Research Center)””* or 250-400 nl of AAV2/1-hSynap-GCaMP5G-WPRE- 
S$V40 (2.13 X 10'* GC, Penn Vector Core and the GENIE project)** were injected 
at a maximum rate of 100 nl min‘ using a glass pipette (Wiretrol, Drummond) 
attached to an oil hydraulic manipulator (MMO-220A, Narishige). The solution 
was allowed to diffuse for at least 10 min before the pipette was withdrawn. The 
craniotomy was filled with Kwik-Cast (WPI) and the skin was re-attached with 
stainless steel staples (Precise DS15, 3M) or, in the case of AAV-GCaMP injections, 
a glass window was sealed into the craniotomy as previously described**. For cortical 
GCaMP expression, 20 nl of a mixture of AAV2/9-syn.Flex-GCaMP6 s-WPRE-SV40 
(1.35 X 10'? GC ml !; Penn Vector Core and the GENIE project)?” and AAV2/1- 
hSyn-Cre-WPRE-hGH (1.04x10'? GC ml}; Penn Vector Core) (15,000:1) were 
injected at a maximum rate of 10 nl min’ just before sealing the window. The 
anaesthesia was reversed with an s.c. injection of a mixture containing atipamezole 
(Alzane, 2.5 mg kg~'), flumazenil (Anexate, 0.5 mg kg‘), and buprenorphine 
(Temgesic, 0.1 mg kg~') in sterile NaCl 0.9% (AFB-mix). 
Cannulation and in vivo photostimulation. At least 2 weeks after the injection of 
AAV2-ChR2, mice were anaesthetized with MMF-mix. Intrinsic signal optical 
imaging was performed as above. A 21-gauge cannula (PlasticsOne) with 2.9 mm 
of exposed tip was stereotactically inserted through a burr hole (RC, —2.20 mm; 
ML, —1.20 mm; DV, —3.00 mm from bregma) and secured in place with dental 
cement (Jet Repair Acrylic, Lang Dental Manufacturing). The cannula was closed 
using a screw cap, and anaesthesia was reversed using AFB-mix. Mice were allowed 
to recover for 1 day before being prepared for in vivo patch-clamp recordings. 
For the in vivo photostimulation of ChR2-expressing POm neurons, a stripped 
multimode optical fibre (BFL37-200, Thorlabs) fused to an internal guide (PlasticsOne) 
was inserted into the cannula. The fibre was coupled to a blue DPSS laser (SDL- 
473-050MEL, Shanghai Dream Lasers Technology), which was triggered by a pulse- 
stimulator (Master-8, A.M.P.I). The rise time of a 1-ms laser pulse (300 js) was 
determined with a high-speed Si photodetector (DET10A, Thorlabs) coupled to 
an oscilloscope. The power output of the pulses was ~70% of the steady-state 
power. The steady-state power at the tip of the fibre was measured using a power 
meter (PM100D, $120C, Thorlabs) and adjusted before every recording session to 
~40 mW mm’. No significant reduction in power was observed at the end of the 
experiments. 
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Ca’* imaging and image analysis. A custom-made stainless steel post was cemented 
to the skull with dental acrylic (Jet Repair Acrylic, Lang Dental Manufacturing). Imag- 
ing was performed 14 to 30 days after virus injection using a custom-built 2-photon 
laser scanning microscope (https://openwikijanelia.org/wiki/display/shareddesigns/ 
Shared+Two-photon+ Microscope+ Designs), and using the custom-developed 
acquisition and microscope control software package ScanImage* (https://openwiki. 
janelia.org/wiki/display/ephus/ScanImage)”*. For imaging of awake mice, mice were 
trained and habituated to the microscope setup for 7 days before the experiment. 
During imaging the mice were monitored using an infrared sensitive camera. The 
GCaMPs were excited using a Ti:sapphire laser (Coherent) tuned to 4 = 910 nm. 
For detection we used GaAsP photomultiplier tubes (10770PB-40, Hamamatsu) 
and a X20 (0.8 NA) microscope objective (Olympus). For imaging of axons the 
field of view typically spanned 300 X 300 jim (256 lines, 1ms per line); for dendrites 
43 um X 21 - 43 um (64 lines, 0.5 ms per line). The average excitation power was 
kept below 40 mW, as measured at the focal point of the objective. Bleaching of 
GCaMPs was negligible. Episodes in which piezo-mediated whisker deflections 
were immediately followed by active whisking were excluded from the analysis. 
All image analyses were performed using custom routines in Matlab. We used 
cross-correlation based on rigid body translation to register images over time. Small 
regions of interest (ROIs) were drawn in the dendritic shafts (~1 uum?) or around 
axonal boutons (~7 fim”), based on averaged and standard deviation images. For 
each ROI the baseline fluorescence (Fo) was calculated based on the mean fluores- 
cence intensity within the selected ROI, averaged over 100 consecutive frames before 
whisker stimulation. Change in fluorescence (4F,/Fo) was defined as (F, — Fo)/Fo, 
were Fis the fluorescence intensity at time t (f = time of the first pixel in each frame). 
Boutons were imaged at a depth of 18 um to 42 jum below the pia, dendrites between 
18 um and 70 um. The onset of the response was defined as the time in between the 
whisker stimulus trigger and the time point at which fluorescence intensity reached 
a2 baseline standard deviation threshold (Fy + (2 X s.d.)). For extracting spatial 
and temporal properties of dendritic Ca** events (Extended Data Fig. 5) a Gauss- 
ian function was fitted to the fluorescence intensities of the ROIs in a visually ‘active’ 
region. All Gaussian fits were normalized to their maximum value. The time course 
of the change in fluorescence was extracted from the ROI that represented the peak 
of the Gaussian. A region was considered to be responsive if the fluorescence intensity 
remained above the threshold (Fy + (2 X s.d.)) for at least three imaging frames. 
To evaluate the ChR2-Venus expression profiles and the spread of fluorescent 
mucimol (Extended Data Figs 8 and 9), wide-field epifluorescence images were taken 
of fixed brain slices. Illumination was set such that the full dynamic range of the 
16-bit images was used. A threshold was applied using Fiji’s” implementation of the 
Kapur-Sahoo-Wong (Maximum Entropy) method”. The resulting image masks were 
registered to the corresponding coronal plates (ranging from — 1.94 to —2.70 mm) 
of the Paxinos mouse brain atlas’ using Photoshop (Adobe), at various distances 
posterior to bregma. 
In vitro whole-cell recordings. Coronal slices (thickness = 350 ym) were cut with 
a vibratome (Leica VT $1000) in ice-cold cutting solution containing (in mM): 83 
NaCl, 2.5 KCl, 0.5 CaCls, 3.3 MgSOu, 26.2 NaHCOs, 1 NaH2POu,, 22 D-glucose and 
72 sucrose. Slices were transferred in normal ACSF at ~34 °C for about 30 min and 
stored at room temperature before the experiment. ACSF contained (in mM): 124 
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NaCl, 3 KCl, 2 CaCl, 1.3 MgSO4, 26 NaHCOs, 1.25 NaH2PO,, 10 D-glucose with 
osmolarity of 300 mOsm and pH7.3 when bubbled with 95% O, + 5% COp. 
Individual slices were transferred to the recording chamber and perfused with oxy- 
genated ACSF. All recordings were performed at 37 °C (+ 0.5 °C). Whole-cell record- 
ings were performed using an IR-DIC microscope (Olympus BX51). Recordings 
were performed using borosilicate glass pipettes with resistance of 4-8 MQ and 
filled with an intracellular solution containing (in mM): 110 K-gluconate, 10 KCl, 
10 HEPES, 4 ATP, 0.3 GTP, 10 phosphocreatine and 0.4% biocytin. Recordings were 
amplified using Multiclamp 700 A amplifiers (Molecular devices, USA), filtered at 
4 KHz, digitized (5-20 KHz), and acquired using PulseQ electrophysiology pack- 
age running on Igor Pro (Wavemetrics, USA). Data processing and analysis was done 
using Igor (Wavemetrics) and Excel (Microsoft Office). For optogenetic experiments, 
axons terminals were stimulated with a LED (Thorlabs, Germany) that was focused 
around the recording electrode using a 4x microscope objective. Drugs used include 
4-AP (100 uM; Sigma Aldrich) and tetrodotoxin (TTX) (1 1M; Latoxan). To con- 
firm the identity of recorded neurons, 1 mM Alexa 568 hydrazide (Invitrogen) was 
added to the intracellular solution. 

Statistical analysis. All statistics were performed using Matlab. The « significant 
level was set at 0.05. Normality of all value distributions was assessed by Shapiro- 
Wilk test (x = 0.05). Equality of variance between different distributions was assessed 
by the Levene median test (« = 0.05). Standard parametric tests were only used when 
data passed the normality and equal variance tests (P > 0.05). Non-parametric tests 
were used otherwise. Only two-sided tests were used. Randomization and blinding 
methods were not used. No statistical methods were used to estimate sample size, 
but B-power values were calculated and are provided in Supplementary Information, 
or in the Extended Data Figure legends. 
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Extended Data Figure 1 | An NMDAR block suppresses RWS-induced 
sustained depolarization and prevents LTP. a, Examples of the postsynaptic 
depolarization as induced by RWS in controls (top), following a blockade of 
NMDARs by DAP5 (middle), and hyperpolarization (bottom). Only the first 
2.5 s (20 deflections) of the recordings are shown. Responses were fit with 

an exponential, in which V(t) is the depolarization at time point t in seconds, Vc 
is the depolarization constant (for example, reached after >10 s RWS), Vo and 
ty are the depolarization and time at RWS onset, and 7 is the time constant. 
b, Left, under control conditions (dark blue), the level of LTP is linearly 
correlated to the time constant (t) of the exponential decay (R? = 0.49, 
P<0.05). Following a suppression of NMDAR conductances the time constant 
and the percentage of LTP are independent (DAP5, light blue, R? = 0.03, 

P> 0.05; Hyper, red, R? = 0.03, P> 0.05). Each circle represents a single cell. 
Right, DAP5 significantly reduced the time constant (t) of the exponential 
decay (P < 0.001; Kruskal-Wallis one-way ANOVA on ranks. *P < 0.05, 
post-hoc Dunn’s comparisons versus control condition). ¢, Left, the sustained 
depolarization during RWS is altered when NMDAR conductances are 
suppressed by hyperpolarization (red). Black bar indicates the RWS period. 
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Right, cumulative depolarization at the end of the RWS period (control, 

127 + 21 mV, n = 11; +DAP5, 41 + 28 mV, n = 7; Hyper, 34 + 25 mV, n= 9; 
P= 0.02, one-way ANOVA (f = 0.62) and *P < 0.05, post-hoc Holm-Sidak 
comparisons versus control condition). d, RWS failed to induce LTP when 
NMDARs are blocked by extracellular application of DAP5, when MK801 is 
included in the patch pipette, and when cells are hyperpolarized (control, 
119.8 + 6%, n = 11; DAP5, 98.7 = 2.5, n = 7; iMK801, 98 + 5%, n = 3; Hyper, 
103.6 + 6%, n = 9; P = 0.032, one-way ANOVA and *P < 0.05, post-hoc 
Holm-Sidak comparisons versus control condition). e, A fast Fourier transform 
(FFT) of the responses during RWS (1 min), in controls (left) and after DAP5 
application (middle). The FFT is normalized to the average FFT between 0.1 
and 1 Hz. The presence of a strong 8 Hz component after DAP5 indicates 
that RWS-mediated inputs remain to be activated after the NMDAR block. 
Right, the magnitude of normalized FFT at 8 Hz is similar between control 
+DAP5 conditions (control, 8.8 + 8.5, n = 11; +DAP5, 6.3 + 4.6, n = 7; 

P= 0.733, Mann-Whitney U-test) confirming that part of the whisker 
deflection-evoked synaptic input was unaffected by DAP5, and follows the 
rhythmic stimulation. Values in b-e are represented as the mean + s.e.m. 
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Extended Data Figure 2 | RWS-induced LTP is column- and whisker- 
specific. a, Left, schematic of the experiment: whole cell recordings are targeted 
to the C2 barrel column. Responses are recorded upon deflection of the C2 
(principal whisker, wC2) or C1 (surrounding whisker, wC1) whisker. Inset, 
C2 (blue) and C1 (red) barrel-related columns were mapped using intrinsic 
optical imaging. Right, schematic of PW and SW-associated synaptic pathways 
projecting to L2/3 pyramidal cells. After RWS of wCl (orange), single-whisker 
deflection PSPs were evoked either by the same whisker that was used for 
RWS (wCl1, red) or by the neighbouring whisker (wC2, blue). b, Time course of 
mean wC2- (blue) and wCl-evoked (red) PSP port amplitudes ( + s.e.m.) 
following RWS of wCl (orange bar). c, Mean PSPshort amplitude ( + s.e.m.) 
before and after RWS. Top, wC1-RWS did not significantly enhance 
wC2-evoked mean PSPshort amplitudes (Pre, 7.8 + 1.1 mV; Post, 7.9 + 1.3 mV; 
n= 8; P= 0.960, paired t-test (§ = 0.05)). Bottom, wCl-evoked amplitudes 
were enhanced in some cells, but despite this positive trend the average 
difference was not significant (Pre, 7.4 + 1.3 mV; Post, 8.6 + 1.3 mV; n= 5; 
P= 0.240, paired t-test (B = 0.15)). Grey lines indicate pairs. d, e, Although 
the average PSP amplitude as evoked by wC1 was not significantly different 
from wC2 (wC2, 101 + 2.5%, n = 8; wCl, 118 = 10%, n = 5; P= 0.2, 
Mann-Whitney U-test; d), the number of significantly potentiated cells was 
higher for wC1 (3 out of 5) than for wC2 (0 out of 9; e). 
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Extended Data Figure 3 | Characterization of RWS-induced LTP and 
stability of cell membrane properties. a, For each potentiated cell, a single 
exponential was fit to the normalized PSP.,or. amplitudes immediately 
following RWS, using the following equation: PSP(t) = (1 + APSP, rp) - 
(APSP, rp * e ~t/9)) in which PSP(#) is the normalized PSP amplitude at time t 
in minutes, APSP; 7p is the average change in PSP amplitude during the LTP 
phase, and t the time constant (mean t = 1.09 + 0.26 min; range 0.3-2.34 min; 
n= 7).b, Left, time course of mean PSP,yo, amplitudes following two 
consecutive RWS protocols (RWS1 and RWS2). Right, the mean PSPshort 
amplitude increases upon RWS1 but does not further increase upon RWS2, 
indicating that RWS-evoked LTP is occluded (Pre, 10.32 + 1.6 mV; RWS1, 
12.35 + 1.6 mV; RWS2, 12.98 + 1.5 mV; n = 5; P = 0.006, repeated measures 
ANOVA (f = 0.97); post-hoc Holm-Sidak comparisons). The error bars 
represent s.e.m. c, Example of typical single-cell membrane potential 
fluctuations during anaesthesia before (top, grey) and after (bottom, black) 
RWS. Spontaneous action potentials (APs) are rare and visible only during up 
states. d, Example of spontaneous APs before (grey) and after (black) RWS. APs 
were time-aligned to their respective threshold. e, Effect of RWS on AP 
thresholds, in cells that displayed some spontaneous APs before and after RWS 
(but not during RWS). AP threshold was computed as the minimal membrane 
potential value at the time corresponding to the peak of the third derivative 
of the membrane potential. The Athreshold was calculated as the difference 
between the mean AP threshold after RWS and the mean AP threshold before 
RWS (each circle represents a cell). The level of LTP is independent of 
Athreshold (R? = 0.07, P> 0.05, all cells pooled). f, The mean threshold for 
spontaneous AP is not affected by RWS (Pre, —43.7 + 0.9 mV; Post, 

—44.6 + 1.2 mV; n = 12; P= 0.2, paired t-test (8 = 0.13)). g, RWS does not 


affect the resting membrane potential measured at J = 0 during down states 
(Pre, —71.5 + 0.4 mV; Post, —71.4 + 0.6 mV; n = 12; P= 0.8, paired t-test 
(B = 0.05)). h, The difference between input resistance (ARin) before and after 
RWS. ARin is independent of the level of LTP (R? =10 *, P>0.05; each 
circle represents a cell). Inset, Rin before (grey) and after (dark blue) RWS 
are estimated by measuring the steady-state resistance of a hyperpolarizing 
current pulse. i, The mean Rin is not affected by RWS (Pre, 46.6 + 7 MQ; Post, 
50 + 8 MQ; n = 11; P = 0.13, paired t-test (8 = 0.22)). j, The number of evoked 
APs as a function of injected somatic current injection is not significantly 
modified by RWS (two-way ANOVA, P > 0.5, n = 4). k, The relationship 
between ARin and the relative change in amplitude of PSP hort. Each circle 
represents a cell that was hyperpolarized only during RWS (RWS + Hyper, 
R= 0.6). 1, The mean Rin is not affected by RWS + Hyper (Pre, 37.4 + 4 MQ; 
Post, 37.7 + 5 MQ; n = 9; P = 0.9, paired t-test (6 = 0.05)). m, n, The PSP short 
amplitude evoked by low frequency (0.1 Hz) single whisker deflections, 
before (Pre), during (Hyper) and after (Post) hyperpolarization (—100 mV). 
Hyperpolarization increases the PSP amplitude (m) due to an enhanced driving 
force (Pre, 11.3 + 2.5 mV (average —3 to —1 min); Hyper, 14.7 + 3 mV; 
Post (average +1 to +2 min), 11.3 + 2.4 mV; n = 4; P = 0.002, one-way 
repeated-measures ANOVA (f = 0.98); ***P < 0.001, Holm-Sidak post-hoc 
comparisons Hyper versus Pre and Post conditions. However, the integrated 
PSP is significantly reduced due to the absence of plateau potentials (n) 

(Pre, 0.625 + 0.2 V-ms; Hyper, 0.335 + 0.1 V-ms; Post, 0.626 + 0.2 V-ms; 
n= 4; P= 0.011, one-way repeated-measures ANOVA (B = 0.86); 

**P = 0.007, Holm-Sidak post-hoc comparisons Hyper versus Pre and 

Post conditions). 
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Extended Data Figure 4 | The extraction of whisker-evoked plateau 
potentials. a, Individual PSPs (grey lines) and the average PSP (dark blue line) 
in a single cell in response to single principal whisker deflections (100-ms 
deflections). Individual traces show short and long-latency components. The 
responses to the 30 successive deflections reveal two whisker-evoked PSP 
populations: one that only contains short-latency PSPs and a second 
population that contains both short and long-latency PSPs. b, Similar to 
non-spiking cells, the distribution of whisker-evoked actions potentials (APs) 
also reveals two populations of spikes, based on their onset delay. Inset, for 
each spiking cell, whisker-evoked spikes were sorted as early and late spikes, 
according the delay of the first peak of the subthreshold response. The 
corresponding probabilities were then computed (early, P = 0.04 + 0.02, 

n= 15; late, P= 0.06 + 0.02, n = 15; P = 0.345, z-test). The equal probabilities 
indicate that L2/3 cells spike as often upon a long-latency depolarization as 
in response to a short-latency PSP. ¢, Left, for each trial, the relationship 
between the PSP half-peak amplitude and the average membrane potential 
between 50 and 100 ms after the onset reveals two distinct clusters. Dotted line 
represents the identity line. Right, cluster 1 (top) is defined by an index < 0 and 
consists of PSPs containing only a short latency PSP that quickly returns to 
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the resting membrane potential. Cluster 2 (bottom) is defined by an index > 0 
and consists of compound PSPs with short and long-latency components. 
The long-latency component of the PSP has a strong plateau-like appearance. 
Therefore, we defined this index as the Indexpjateau- d, Example of the 
distribution of clusters 1 and 2 over time. The Indexpiateay Was computed from 
the example shown in a. The probability of eliciting plateau potentials in a 
neuron is calculated by dividing the number of PSPs in cluster 2 (N2) by 

the total number of PSPs (N1 + N2). e, f, Blocking NMDARs (light blue) 
significantly reduced the probability of eliciting plateau potentials as compared 
to controls (dark blue) (e, ks test, P< 0.001), but does not affect the amplitude 
of the short-latency PSP (PSPshort) (f ks test, P> 0.05). g, Blocking NUDARs 
by epidural application of DAP5 or intracellular MK801 does not affect the 
amplitude of PSP yo (control, 10.5 + 0.8 mV, n = 33; DAPS5, 9.5 + 1.5 mV, 
n= 12; iMK801, 10.3 + 1.3 mV, n = 10; P> 0.05, one-way ANOVA 

(B = 0.05)). h-j, Changes in the length of the whisker deflection period 
(stimulus length) do not affect the whisker-evoked mean PSP, jor amplitude 
(h), the mean PSP integral (i), or the probability to elicit plateau potentials 
(j) (n = 4, P> 0.05, one-way ANOVA (8 = 0.05)). Values in b and g-j are 
represented as the mean + s.e.m. 
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Extended Data Figure 5 | NMDAR-dependent whisker-evoked Ca?* events 
in L2/3 pyramidal neuron dendritic tufts. a, Examples of single whisker 
deflection-evoked Ca”* responses in dendritic spines (arrowheads) and 
dendritic shafts (regions in between arrows). b, Top left, GCaMP6 s 
fluorescence standard deviation image with ROIs. Bottom left, raster plot of 
AF/Fo for each ROI in the top panel (aligned). Red bar represents the whisker 
stimulation onset. Right, AF/Fo traces of some ROIs from a. c, Left, time course 
of mean dendritic Ca** response in individual dendritic branches upon a 
single whisker deflection (2-5 trials per branch, n = 48 dendritic branches, 
n= 9 mice). Grey, individual branches. Black, average response. Right, 
distribution of response onset times. d, Averaged (blue thick line) and 
individual (grey lines) Gaussian fits of local responsive regions in dendritic 
shafts. e, Distribution of the FWHM of the Guassian fits in d. f, Epidural 
application of DAP5 (10 |1M) significantly reduces whisker-evoked local 
dendritic Ca?* response probabilities. The subsequent addition of 1 mM DAP5 
in some cases further reduced probabilities and in others did not show an 
additive effect (control, 0.29 + 0.06, DAP5 (10 LM), 0.14 + 0.04; n = 16 
branches, n = 3 mice; P = 0.002, Wilcoxon signed-rank test; DAP5 (10 1M), 
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0.08 + 0.03; DAP5 (1 mM), 0.04 + 0.02; n = 9 branches, N = 2 mice; P = 0.37, 
Wilcoxon signed-rank test). g, On average, DAP5 significantly reduces 
whisker-evoked local dendritic Ca** response probabilities (control, 

0.28 + 0.03; DAP5 (10 1M), 0.12 + 0.03; DAP5 (1 mM), 0.04 + 0.02; Kruskal- 
Wallis one-way ANOVA on ranks; post-hoc Dunn’s comparisons versus 
control condition, P< 0.05)). h, Left, GCaMP6 s fluorescence standard 
deviation image with ROIs. Right, AF/Fo traces of some ROIs from the left 
panel. Grey box represents the RWS period. i, Integrated AF/F, in dendritic 
branches during RWS (0-15 s) as a function of the response before RWS (0-15 
baseline). Each circle represents a single dendritic branch. Red, global events 
(responses spanning the whole field of view, minimally 43 1m); grey, local 
events (responses spanning a portion of the field of view, maximally 43 um). 
Black line indicates the identity line. RWS significantly increases AF/Fo for a 
substantial number of branches. j, The average integrated AF/Fy in dendritic 
branches during RWS (0-15 s) is significantly reduced upon topical application 
of DAPS (control, 168 + 15%; DAPS (10 1M), 119.6 + 14.5%; DAP5 (1mM), 
100.1 + 18.5%; paired t-test). 
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Extended Data Figure 6 | Whisker-evoked activity in cortical POm 
efferents in anaesthetized and awake mice. a, c, Raster plots of average Ca?t 
responses (AF/Fo) in 400 different cortical POm efferent boutons over 10-20 
successive (0.1 Hz) bursts of 5 whisker deflections (20 Hz, black bar) in 
anesthetized mice (a) and in awake mice (c). b, d, Average whisker-evoked 
responses over 400 boutons under anaesthesia (b) and under wakefulness 
(d). e, Raster plot for 25 of the most active boutons (out of 765) upon RWS. 
Black bars indicate the period of RWS (1 min). The Ca** response in some 
boutons remains elevated over the whole RWS period. f, Integrated Ca?* 
responses of individual boutons during RWS (0-15 s) as a function of their 
responses before RWS (15 s baseline). Each circle represents a single bouton. 
The 25 most and least responsive boutons are in blue and red respectively. Black 
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line indicates the mean relationship (linear fit). Dotted lines indicate the 
relationship at various standard deviations from the fit. A substantial 
proportion of boutons (16%; 124 out of 765) display RWS/baseline ratios larger 
than 1 s.d. from the mean. g, Time-lapse image of fluorescence change 
representing Ca** responses in axonal boutons (dotted circles) upon a single 
whisker deflection (red bar; 45 ms). Response onsets are indicated by arrows. 
Scale bar represents 1 um. h, Example of the response curve of the boutons 
in g. Response onset latency was defined as the time frame in which AF/Fp 
exceeded 2 X s.d. of the baseline. i, j, Distributions of response onset latencies 
under anaesthesia (black, 120 trials, n = 5 boutons, n = 3 mice) and under 
wakefulness (blue, 11 trials, n = 5 boutons, n = 3 mice). 
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Extended Data Figure 7 | Plateau potentials in L2/3 pyramidal neurons 
and POm-efferent activity are not whisker-specific. a, Schematic of the 
experiment. Whole-cell recordings are targeted to the C2 barrel column. 
Responses are recorded upon deflection of the C2 whisker (wC2, principal 
whisker, PW, blue) or the C1 whisker (wC1, surrounding whisker, SW, red). 
b, Right, example of 2PLSM images of POm boutons expressing GCaMP5. Both 
the PW (wC2) and SW (wCl1) evoke a Ca?t response. ¢, Left, the average Ca2t 
transient (AF/Fo) for both whisker deflections (shadows represent the s.d.). 
Right, mean AF/Fy upon deflection of the PW (wC2) and SW (wC1) (wC2, 
1.37 + 0.07, n = 5; wC2, 1.38 + 0.08, n = 5; P> 0.05). This confirms that POm 
activity is not selective for whiskers. Values are represented as mean + s.e.m. 
d, Top, example of the average PSP evoked by the PW (blue) or the SW (red). 
To estimate the integral of the plateau potential (bottom), the decay of the 
first component is fitted with a single exponential and subtracted from the 


average of PSPs containing both short and late-latency components (cluster 2 
in Extended Data Fig. 4c). e, For each parameter, whisker selectivity is 
defined by the ratio between the PW and SW: (PW — SW)/(PW + SW). 

All parameters related to plateau potentials (plateau strength, plateau integral, 
probability) are not specific to either one of the whiskers. In contrast, and 

as expected, the short-latency PSP amplitude (PSPshort) is higher (and thus 
more selective) for the PW (PSP nor 0.25 + 0.09, n = 27; plateau strength, 
0.006 + 0.05, n = 26; plateau integral, 0.04 + 0.02, n = 26; plateau probability, 
0.07 + 0.04; n = 27; P<0.001, one-way ANOVA on ranks; *P< 0.05, 
post-hoc Dunn’s comparisons versus PSPshort condition). As the amplitude of 
short-latency PSPs is whisker-selective and plateau potentials are not, it is 
conceivable that POm associated synaptic pathways are responsible for 
mediating whisker-evoked plateau potentials. 
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Extended Data Figure 8 | AAV-mediated expression of ChR2-Venus in the 
POm nuclei of the thalamus and their efferents in L1. a, Representative 
example of the ChR2-Venus expression profile in the mouse thalamus. The 
expression profile of ChR2-Venus is depicted in blue, POm nuclei in light 
green, and the VPm in light red. b, Left, example of a cortical slice with 
ChR2-Venus fluorescence in the caudal sector of the POm (bregma —2.3). 
Right, an image of the fluorescence profile in the somatosensory cortex in the 
same animal as on the left. Fluorescence intensities (F,,,,.) were measured as a 
function of cortical depth (in pixels (px)) by summing all pixels within the 
dotted rectangle over the short axis (pixel size = 1.85 jim). The fluorescence 
intensity profile is similar to the cortical projection pattern of efferents from the 
caudal sector of the POm (Supplementary Note 4). c, Plot comparing the 
intensity profiles of 10 different animals in which injections were aimed at 
the POm nuclei in the thalamus. Animals classified as bearing expression 
profiles in the barrel cortex that are typical of POm projections (Supplementary 
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Notes 4 and 5) displayed distinct fluorescence peaks in L1 (0.04 - 0.12 
normalized depth) and L5 (0.48 - 0.7 normalized depth; n = 7, black lines 
and blue line). In contrast, animals with a (additional) distinct peak in L4 
(0.36 — 0.48 normalized depth; n = 3, red lines) were considered as having at 
least some spurious expression in VPm, and were thus excluded from the 
analysis in Fig. 3. The classification matched the expression profiles in the 
thalamus (Supplementary Information). d, e, Assessment of POm mediated 
synaptic inputs onto L2/3 neurons in acute cortical slice preparations. 

e, Schematic of the slice experiment. L2/3 pyramidal neurons were recorded 
during local photostimulation (through the objective) of ChR2-expressing 
POm axons. f, Left, example of photostimulation-evoked PSPs in a single L2/3 
cell under control conditions and following bath application of TTX and 4AP. 
Right, average PSP amplitudes in controls and after TTX + 4AP application 
(control, 1.8 + 1.1; TTX + 4AP, 0.31 + 0.08, n = 4, P = 0.125, paired Wilcoxon 
signed-rank test). 
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Extended Data Figure 9 | The spread of fluorescent muscimol in thalamic 
nuclei. a, Coronal diagrams of the mouse brain adapted from the Paxinos 
atlas*' including the POm (light green) and VPm (light red) nuclei at 

various posterior distances from bregma. Each red or green line represents 
the maximal spread of fluorescent muscimol, as assessed using whole field 
epifluorescence microscopy (Olympus; X20 and X60 objective) and 
Neurolucida (Microbrightfield) reconstructions. Green lines (8 mice) represent 
injections that were confined to the POm. Red lines (3 mice) represent 
injections that infiltrated both POm and VPm thalamic nuclei. b, c, Examples of 
a muscimol injection in the caudal part of the POm (b) and an injection that 
spread into both POm and VPm nuclei (c). d, Examples of whisker-evoked 
PSPs in L2/3 neurons of mice in which muscimol was present in both POm 
and VPm. Successes (top) and failures (bottom) are shown. Grey lines, 
individual trials, light red lines, average. e, Blocking activity in VPm decreases 
the whisker-evoked PSP success rate (top; control, 1 + 0, n = 33; POm block, 
1+0,n=9; POm + VPm block, 0.56 + 0.1, n = 7; P< 0.001, one-way 
ANOVA on ranks; *P < 0.05, post-hoc Dunn’s comparisons versus control 
condition), as well as PSP.hort amplitudes (bottom; control, 10.5 + 0.8, n = 33; 
POm block, 7.5 + 1.3, 1 = 9; POm + VPm block, 3.15 + 0.6, n = 7; P< 0.001, 
one-way ANOVA; *P < 0.05, post-hoc Holm-Sidak’s comparisons versus 
control condition). f, Blocking POm + VPm or POm only significantly 
decreases the probability of plateau potentials (top; control, 0.35 + 0.04, n = 33; 
POm block, 0.16 + 0.04, n = 9; VPm block: 0.22 + 0.05, n = 7; P< 0.001, 
one-way ANOVA on ranks; *P < 0.05, post-hoc Dunn’s comparisons versus 
control condition), and the normalized plateau strength (bottom; control, 
10.15, n = 33; POm block, 0.19 + 0.05, n = 9; POm + VPm block, 
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0.22 + 0.07, n = 7; P< 0.001, one-way ANOVA on ranks; *P < 0.05, post-hoc 
Dunn’s comparisons versus control condition). g, examples of single-cell 
spontaneous membrane potential fluctuations during anaesthesia in controls 
(top) and upon muscimol injections in POm (middle) or POm + VPm 
(bottom). h, Top left, FFT of membrane potentials in controls (dark blue) and 
after muscimol injection in POm (green) or POm + VPm (light red). Top right, 
Blocking POm + VPm significantly decreases the 1-5-Hz range in the FFT 
(control, 7.33 + 0.76, n = 14; POm block, 8.03 + 0.81, n = 8; VPm block, 

4.2 + 0.37, n = 7; P = 0.008, one-way ANOVA; *P < 0.05, post-hoc Holm- 
Sidak’s comparisons versus control condition). Bottom, blocking POm + VPm 
significantly decreases the probability of spontaneous up states (left (Pup); 
control, 0.16 + 0.016, n = 14; POm block, 0.15 + 0.008, n = 8; POm + VPm 
block, 0.09 + 0.016, n = 7; P = 0.023, one-way ANOVA; *P < 0.05, post-hoc 
Holm-Sidak’s comparisons versus control condition), as well as the amplitude 
of spontaneous up states (right (AMP,p); control, 22.3 + 1.9, n = 14; POm 
block, 21.5 + 1.6, n = 8; POm + VPm block, 14.1 + 1.7, n = 7; P = 0.02, 
one-way ANOVA; *P < 0.05, post-hoc Holm-Sidak’s comparisons versus 
control condition). i, Animals in which muscimol injections in the medial 
posterior thalamus did not infiltrate the POm (out of POm) were used as a 
negative controls ( = 6). In these animals, the probability of eliciting plateau 
potentials (P,tateau) remained equal to controls. The probability was 
significantly reduced, only when muscimol was correctly targeted to POm 
(control, 0.6 + 0.04, n = 44; +muscimol In POm, 0.16 + 0.04, n = 9; 
+muscimol Out POm, 0.72 + 0.06, n = 8; P< 0.001, one-way ANOVA on 
ranks; *P < 0.05, post-hoc Dunn’s comparisons versus control condition). 
The values represent the mean + s.e.m. 
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Luminal signalling links cell communication to tissue 
architecture during organogenesis 


Sevi Durdu!, Murat Iskar!, Celine Revenu't, Nicole Schieber', Andreas Kunze!, Peer Bork!, Yannick Schwab! & Darren Gilmour! 


Morphogenesis is the process whereby cell collectives are shaped 
into differentiated tissues and organs’. The self-organizing nature of 
morphogenesis has been recently demonstrated by studies showing 
that stem cells in three-dimensional culture can generate complex 
organoids, such as mini-guts’, optic-cups’ and even mini-brains*. To 
achieve this, cell collectives must regulate the activity of secreted sig- 
nalling molecules that control cell differentiation, presumably through 
the self-assembly of microenvironments or niches. However, mech- 
anisms that allow changes in tissue architecture to feedback directly 
on the activity of extracellular signals have not been described. Here 
we investigate how the process of tissue assembly controls signalling 
activity during organogenesis in vivo, using the migrating zebrafish 
lateral line primordium’. We show that fibroblast growth factor (FGF) 
activity within the tissue controls the frequency at which it deposits 
rosette-like mechanosensory organs. Live imaging reveals that FGF 
becomes specifically concentrated in microluminal structures that 
assemble at the centre of these organs and spatially constrain its sig- 
nalling activity. Genetic inhibition of microlumen assembly and laser 
micropuncture experiments demonstrate that microlumina increase 
signalling responses in participating cells, thus allowing FGF to co- 
ordinate the migratory behaviour of cell groups at the tissue rear. 
As the formation ofa central lumen is a self-organizing property of 
many cell types, such as epithelia® and embryonic stem cells’, lumi- 
nal signalling provides a potentially general mechanism to locally 
restrict, coordinate and enhance cell communication within tissues. 

A major challenge in biology is to explain how the pattern of complex 
organs emerges through dynamic self-organizing processes occurring 
at cellular and molecular scales'*”. The development of the zebrafish 
posterior lateral line system provides an example of an in vivo organo- 
genesis process that has the potential to be understood quantitatively 
at subcellular resolution’®. Here, a series of rosette-like mechanosensory 
organs is assembled and deposited along the flanks of the embryo by a 
collectively migrating epithelial primordium*. While a number of sig- 
nalling pathways required for this process have been identified”, it is 
currently not known how their activity is coupled to this organogen- 
esis process. We therefore first performed a quantitative analysis of the 
normal organ deposition process by time-lapse imaging of many wild- 
type (WT) embryos (Fig. 1a and Supplementary Videos 1 and 2). This 
revealed that the overall pattern of organ spacing is determined by the 
timing of deposition events, rather than by sustained changes in the speed 
of primordium migration or growth of the embryo (Fig. 1b and Extended 
Data Fig. 1). 

The best candidate regulator of this organ deposition process is FGF 
signalling, as FGF ligands have been shown to be required for organ 
formation’**. To test if this pathway controls organ deposition timing 
we reduced its activity in a stepwise manner, by titrating the FGF re- 
ceptor inhibitor SU5402 (ref. 12). This showed that reducing FGF activity 
results in a dose-dependent delay in organ deposition (Fig. 1c, Extended 
Data Fig. 2 and Supplementary Video 3), a finding we confirmed using 
mutants for Fefrla’’, the receptor that mediates signalling in this con- 
text (Extended Data Fig. 2 and Supplementary Video 4). Conversely, 
when we increased the concentration of FGF-ligand, by expressing a 


progesterone-inducible transcription factor’® (cxcr4b:lexPR) that drives 
uniform overexpression ofa functional fusion protein of Fgf3 and green 
fluorescent protein (/exOP:fgf3-GFP), organ deposition was accelerated 
in a dose-dependent manner (Fig. 1d, Extended Data Fig. 2 and Sup- 
plementary Video 5). Uniform overexpression of Fgf3—GFP did not sig- 
nificantly alter rosette-like organ assembly rate, indicating that its effect 
was primarily on the migratory behaviour of assembled organs (Extended. 
Data Fig. 3). Thus, the timing of this organ deposition process can be 
controlled over a wide dynamic range by the activity level of a single 
signalling molecule. 

Since FGF regulates lateral line organ deposition in a dose-dependent 
manner, its extracellular concentration and distribution must be tightly 
controlled. Imaging the FGF distribution after uniform overexpression 
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Figure 1 | FGF signalling regulates organ deposition timing in a dose- 
dependent manner. a, Quantitative analysis of lateral line patterning. Position 
of organs along lateral line at 2 days post-fertilization (d.p.f.) (cldnb:lynGFP). 
Plot shows intensity profile of pooled organ positions (below, N = 60 
(throughout, N represents number of embryos and n represents data points)). 
Organs and migrating primordium are colour-coded. b, Correlation of organ 
deposition timing and spacing between consecutive depositions (Spearman 

r =0.77,n= 260). ¢, d, Influence of FGF level on organ deposition. 

c, Kymographs of control, 0.5 1M and 1 1M SU5402-treated samples. 

Plot shows quantification of organ deposition timing (n = 82, 114, 104). 

d, Kymographs of control, 5, 10 and 20 14M RU486-treated samples and plots 
of organ deposition timing (n = 64, 57, 65, 58). Scale bars, 500 um (a), 200 um, 
5h (c, d). Statistics: Wilcoxon, ***P < 0.001. 
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Figure 2 | Secreted FGF becomes concentrated in multicellular 
microlumina at the centre of organ progenitors. Fgf3-GFP and secGFP 
images are taken from stable LexOP lines unless otherwise stated. a, Fgf3-GFP 
localizes to apical spheres (arrowheads) after uniform expression (red 
membrane counter-label, cells outlined in dashed lines in side view). 

b, Kymographs showing that the appearance of Fgf3-GFP and secGFP 
accumulation correlates with onset of organ deposition. c, secGFP localization 
in primordium. d, Intracellular accumulation of Fgf3-GFP vesicles after 
brefeldin A (+BfA) treatment. e, Schema comparing expression (green) and 
localization (yellow) patterns of Fgf3-GFP. f, Fgfrla protein distribution 
(anti-Fgfrla antibody) in migrating primordium. g, Anti-Fgfrla antibody 
staining in apical and basal optical slices of a rear rosette (calibration bar: signal 
intensity). h, i, Tight junction ‘bucky ball’ in deposited organs (yellow) and tight 


of Fgf3-GFP revealed that it was concentrated in spherical volumes at 
the apical centre of organ progenitor rosettes (Fig. 2a, e). Time-lapse anal- 
ysis showed that the appearance of these spheres correlated with the 
deceleration and arrest of the associated organ progenitor (Fig. 2b and 
Supplementary Video 6). Apical spheres were also observed when the 
tissue expressed a secreted form of GFP (secGFP; Fig. 2b, c). Inhibiting 
protein secretion with brefeldin A prevented Fgf3-GFP localization to 
these apical spheres and retained it in vesicles within all cells of the pri- 
mordium (Fig. 2d), indicating that these spheres represent tightly re- 
stricted pools of apically secreted proteins (Extended Data Fig. 4). By 
contrast, direct visualization of endogenous F¢frla, using a newly gen- 
erated monoclonal antibody against the zebrafish protein, revealed an 
unrestricted plasma membrane distribution of the receptor (Fig. 2f, g). 
Immunofluorescence of tight-junctions”’ (Fig. 2h-j) and ultrastructural 
analysis using correlative light electron microscopy (CLEM; Fig. 21-n 
and Extended Data Fig. 5)'* demonstrated that these apical spheres of 
secreted protein represent extracellular pockets, or microlumina, assem- 
bled from cell apical domains and displaying the cell junctions char- 
acteristic of a lumen (Fig. 2n, o and Extended Data Fig. 5). Identical 
luminal localization was also observed when Fgf3-GFP was expressed. 
at normal physiological levels using BAC-mediated complementation 
(fef3:fgf3-GEP; Fig. 2k and Extended Data Fig. 6). Interestingly, these 
microlumina showed a geodesic organization to which each cell of the 
organ progenitor contributes a facet and thus has access to this shared 
microenvironment (Fig. 2m, 0). The secGFP signal correlated perfectly 
with the shape of the luminal cavity, even filling ‘side-pockets’ that are 
formed stochastically by protruding sensory kinocilia of differentiating 
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junctions of overlying skin cells (blue). i, Higher-resolution view of h. j, Co- 
labelling of Fgf3-GFP (green) and tight junctions (ZO1, red). k, BAC fgf3:Fef3- 
GFP embryo shows microluminal localization, membranes labelled with 
BAC cxcr4b:Cxcr4b-RFP (REP, red fluorescent protein). -n, CLEM analysis of 
microlumen. 1, Fluorescent images used for CLEM alignment showing slice 
position (dashed), orthogonal view (below), secGFP and electron microscope 
overlay at same position (right). m, Plasma membrane tracking shows 
microlumina is assembled form many cells (pseudocoloured). n, Microluminal 
space and surrounding cell junctions (schematic: section position), boxes show 
higher-resolution view (red, tight junctions (TJ); orange, adherens junctions 
(AJ)). 0, Schematic representation of organ with central microlumen. Scale 
bars, 50 um (a, c, d), 200 um, 5h (b), 20 um (f), 5 pum (g, h, i, j, k, 1, m, n). 


organs (Fig. 2], Extended Data Fig. 5 and Supplementary Video 7), sug- 
gesting that secreted proteins freely diffuse within the microlumen. 
Indeed, fluorescence loss in photobleaching (FLIP) and fluorescence 
recovery after photobleach (FRAP) analysis of Fgf3-GFP confirmed 
that Fgf3-GFP is highly mobile within the microlumen (Extended Data 
Fig. 5). 

“The results described above reveal that microlumina could act as ‘hubs’ 
that locally concentrate secreted FGF molecules and ensure coordinated 
signalling responses within the migrating tissue. Alternatively, signal- 
ling activity may be determined by concentration gradients of freely 
diffusible FGF molecules in the open extracellular environment, con- 
sistent with its known role as a morphogen in other contexts’’. To dis- 
tinguish between these two models (Fig. 3a), we investigated the range 
of FGF action by overexpressing the protein from randomly positioned 
cell clones that were generated either by cell transplantation (Fig. 3b 
and Extended Data Fig. 7) or mosaic expression of lexOP:fgf3-GFP 
(Fig. 3d). Interestingly, Fgf3-GFP was secreted into microlumina inde- 
pendently of the position of the expressing cell within the rosette, indi- 
cating that any cell of the group can contribute signal to the microluminal 
pool (Fig. 3c and Extended Data Fig. 7). Asa first readout of FGF activity, 
we mapped the deposition intervals of organs with and without ectopic 
Fgf3—-GFP-expressing cells. As shown in Fig. 3, individual ectopic Fgf3- 
GFP-expressing cells efficiently arrested the migration of cells that were 
connected to the same Fgf3-GFP-positive microlumen but they had no 
effect on cells in neighbouring organs, even when they were physically 
closer than cells of the same organ (Fig. 3b, d, Extended Data Fig. 7 and 
Supplementary Video 8). Thus, the organ deposition response to ectopic 
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Figure 3 | Microlumina focus FGF-signalling activity within migrating 
collective. a, Schema showing possible outcomes of ectopic FGF experiments. 
b, lexOP:Fgf3-GFP/cxcr4b:nls-tdTomato cell clones (green) transplanted into 
cldnb:lynGFP primordium (red), showing future organ territories (dashed line) 
and final pattern of organ deposition (right). c, Time-lapse of microluminal 
filling by single Fgf3—-GFP cell after drug induction in mosaic tissue (arrow 
heads pointing organ centre). d, Organ deposition pattern after mosaic 
lexOP:fgf3-GFP (green) expression by transient injection. Box-plot of organ 
spacing in mosaic embryos with Fgf3-GFP-positive microlumen (green, 

n = 240), without Fgf3-GFP-positive microlumen (red, n = 170) and control 


FGF activity was local and coordinated, affecting only cells that shared 
a microlumen with Fgf3—GFP source cells. To measure FGF signalling 
more directly, we next monitored the transcription of its immediate 
target gene pea3 (ref. 20) by single-molecule fluorescent in situ hybrid- 
ization approach (smFISH), which revealed a clear Fgf-dependent re- 
sponse (Fig. 3e, f, Extended Data Fig. 8 and Supplementary Video 9). Pea3 
smFISH analysis of primordia carrying ectopic Fgf3-GFP-expressing 
clones showed that all cells in contact with Fgf3—GFP-positive microlu- 
mina upregulated target gene transcription, whereas cells from neighbour- 
ing rosettes showed no change (Fig. 3g), confirming that the response 
to FGF signalling is highly restricted. In addition, this revealed that up- 
regulated FGF signalling responses were uniform across individual organ 
precursors. In conclusion, single-cell ectopic expression of FGF does 
not support a model where FGF responses are determined by a con- 
centration gradient diffusing from secreting cells. By contrast, these 
data are fully consistent with the model where the FGF signal is locally 
concentrated, and collectively presented, by a central microlumen. Thus, 
the formation of microlumen allows FGF signalling, widely studied for 
its role in mediating long-range positional information”, to coordinate 
locally the behaviour of discrete cell groups within the migrating tissue. 

The luminal signalling model predicts that these multicellular assem- 
blies are required for efficient FGF signalling responses. To test this, we 
first applied a genetic method to prevent microluminal assembly, an 
approach complicated by the fact that many key regulators of this pro- 
cess also control epithelial polarity”’’”. We therefore knocked down 
shroom3, an actin-binding” protein required for apical constriction of 
organ progenitor cells but not for epithelial polarity**. Morpholino knock- 
down of shroom3 transiently suppressed microluminal formation, as 
revealed by a failure to concentrate secGFP (Fig. 4a). smFISH analysis 
of pea3 confirmed that FGF target gene transcription was significantly 
reduced (Fig. 4b). Second, we acutely opened microlumen structures by 
two-photon laser micropuncture, which caused rapid leakage of Fgf3- 
GFP (Fig. 4c, d, Extended Data Fig. 9 and Supplementary Video 10), 
confirming that these local build-ups of FGF signal are dependent on mi- 
crolumen integrity. The microluminal opening by laser micropuncture 
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injections (blue, n = 141), normalized by mean interval values of controls. 

e, Magnified views of pea3 smFISH in WT, FGF inducer- (Ind.; 15 4M RU486) 
and FGF inhibitor- (Inh.; 4 4M SU5402) treated primordia, as segmented 
images. Quantification of pea3 transcripts per cell (tissue N = 4, 4, 4, 
approximately 160 cells per tissue). f, Correlation of luminal Fgf3-GFP 
intensity and pea3 transcript counts achieved by titration of inducer (Spearman 
r= 0.85, n = 18); a.u., arbitrary units. g, pea3 smFISH in Fgf3-GFP-positive 
and -negative organs. Only Fgf3-GFP-positive organs show higher pea3 
transcript levels per cell. Scale bars, 50 tm (b left), 200 tum (b right, d), 10 pm 
(c), 2 um (e). Statistics: Wilcoxon. NS, not significant; *P < 0.05; ***P < 0.001. 


was transient, as revealed by the recovery of characteristic microlum- 
inal Fgf3-GFP spheres, showing that this targeted perturbation had 
negligible effects on cell viability (Extended Data Fig. 9). Nevertheless, 
smFISH analysis of pea3 revealed that target gene expression was re- 
duced after transient depletion of microluminal Fgf3-GFP, demonstrat- 
ing that trapping of secreted FGF is required to maintain high signalling 
levels (Fig. 4e). In addition, microluminal opening specifically delayed 
the deposition of targeted organs by prolonging their migration, a direct 
confirmation that microlumina are required for FGF to exert its bio- 
logical role during this organogenesis process (Fig. 4f). Finally, we ex- 
ploited the fact that the microluminal cavity is opened when deposited 
organ progenitors fuse with the overlying skin, a natural event that also 
leads to rapid loss of Fgf3-GFP (Extended Data Fig. 10). Since the timing 
of skin fusion varies between embryos, we could directly compare organs 
where microlumina had just opened with those that were still closed at 
identical developmental stages (Fig. 4g, Supplementary Video 6). smFISH 
analysis of pea3 in this unperturbed context revealed that FGF signal- 
ling was again reduced specifically in organs with opened microlumina 
(Fig. 4g). Combined, these data provide compelling experimental sup- 
port for a model where microlumina act as shared microenvironments 
that locally concentrate FGF to enhance signalling within the migrating 
tissue (Fig. 4h). 

Previous studies addressing the regulation of extracellular signals have 
focused on the role of additional cell surface or extracellular proteins, 
such as heparan sulphate proteoglycans” and receptors*’, whose own 
spatiotemporal regulation is currently under investigation’’. Here, we 
uncover an alternative mechanism that instead exploits an intrinsic bio- 
logical feature of epithelial tissues, namely their ability to assemble a 
shared enclosed lumen’. This finding has important implications for 
understanding how responses to extracellular signals are controlled and 
coordinated in tissues in vivo. To our knowledge, it provides the first 
such mechanism that acts specifically at the level of multicellular organ- 
ization, as only cell groups that assemble a central lumen are able to trap 
and concentrate the freely diffusible ligand (Fig. 4h). We propose that 
formation of the microlumen is required to restrict, coordinate and 
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Figure 4 | Microluminal assembly and integrity 
are required for efficient FGF signalling. 

a, b, Knockdown of shroom3 (Shroom3 MO) 
prevents microlumen formation and leads to lack 
of apical spheres in lexOP:secGFP primordia 

(a, arrowhead), and reduces pea3 transcription, 
shown in profile plot and box-plot quantification 
(b, N = 6, 6). ce, Schema of two-photon 
micropuncture approach. c, Schematic 
representation of micropuncture experiment (left). 
Plot of Fgf3-GFP pool fluorescence intensity after 
micropuncture (green) or internal bleach pulse 
(red). Panels show kymographs. d, Images of 
Fgf3-GFP-positive organ before and after 
micropuncture. e, Pea3 smFISH of Fgf3-GFP- 
expressing organs 30-60 min after micropuncture, 
comparing organ 2 micropunctured samples (pink, 


N= 16) with controls (blue, N = 6). Absolute pea3 
transcripts per cell were normalized to first organ 


for each embryo. f, Organ deposition delay after 
luminal micropuncture of Fgf3-GFP-expressing 
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second (n = 16, 23) and third (n = 7, 7) organs. 
Quantification of organ deposition spacing 
(bottom). g, Comparison of secGFP- (green) 
expressing primordia (red) in identical stage 


embryos; lower specimen shows loss of secGFP 
microlumina in organ 1 after fusion with 

the overlying skin (arrowhead). smFISH reveals 
reduced pea3 transcript levels in opened organs 
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enhance FGF signalling within the migrating tissue. This local increase 
in FGF activity then positively feedbacks on microlumina by increasing 
the epithelial character of responding cells, leading to the formation of 
stable rosettes. Thus, luminal hubs provide a morphogenetic checkpoint 
function by ensuring, in this case, that cells become polarized and or- 
ganized before they can respond efficiently to signals promoting their 
differentiation. Moreover, as lumen formation itself is highly sensitive 
to changes in epithelial polarity and adhesion’, it is likely that luminal 
signalling hubs can be rapidly disassembled and reassembled by pro- 
cesses that alter cell cohesion, such as the epithelial-mesenchymal tran- 
sition that is a hallmark of organogenesis and cancer*’. However, this 
mechanism could potentially be active in any context where cells con- 
struct a lumen or similar enclosed extracellular microenvironment, such 
as the transient tissue-folds that are prevalent during morphogenesis”. 
A notable example is provided by the recent finding that early mam- 
malian embryos and embryonic stem cells self-organize to form polar- 
ized rosettes with a central lumen’, structures that are morphologically 
highly similar to those interrogated here. Our study suggests potential 
signalling roles for shared lumina in many other tissue contexts. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Fish handling. Zebrafish (Danio rerio) strains were maintained following stand- 
ard protocols*’. Embryos were raised in E3 buffer at 26-30 °C. All zebrafish experi- 
ments were conducted on embryos younger than 3 d.p.f., under the rules of the 
European Molecular Biology Laboratory and the guidelines of the European Com- 
mission, Directive 2010/63/EU. In all experiments involving chemical treatment, 
embryos were dechorionated with pronase. Before live imaging and fixation proce- 
dures, embryos were anaesthetized with 0.01% tricaine. For in situ hybridization 
and immunostaining experiments, embryos were treated with 0.002% phenylthio- 
urea at 24 hours post-fertilization (h.p.f.) to prevent pigmentation. For live imaging, 
embryos were mounted in 0.8% low melting agarose in glass-bottom dishes (MatTek 
or CELLview) and imaged at 28 °C unless otherwise stated. The following mutant 
and transgenic strains were used: fefrla°®”°"" (ref. 15), cxcr4b:nls-tdTomato*, 
cldnb:lynGFP**. 

Inducible gene expression system and BAC lines. The LexPR/LexOP transacti- 
vation system'® was used to express genes in the lateral line upon addition of the 
progesterone analogue RU486. Two LexPR ‘driver’ lines were generated by insert- 
ing the cassettes (1) LexPR/polyA/LexOP:lynRFP/SV40polyA/FRT-KanR-FRT or 
(2) LexPR/SV40poly(A)I/FRT-KanR-FRT into the first exon of the Cxcr4b BAC 
clone CH211-145M5 by ET recombineering (Gene Bridges). The KanR cassette was 
subsequently removed with FLP recombinase. To serve as a transgenic marker, the 
“crystal eye’ cry:CFP/KanR cassette was inserted in the BAC backbone as a trans- 
genic marker. Modified BACs were purified (Large Construct Kit, Qiagen) and 
injected into one-cell stage embryos to generate transgenic driver lines. The follow- 
ing LexOP ‘responder lines were generated from multisite-gateway clones (Invi- 
trogen) using the Tol2kit**: (1) LexOP(p5E)/secGFP(pME)/fgf3(p3E), (2) LexOP 
(p5E)/secGFP(pME)/polyA(p3E) and (3) LexOP(p5E)/nlsGFP(pME)/polyA(p3E). 
All clones carry the cmlc2:eGFP ‘bleeding heart’ cassette as a transgenic marker”’. 
Fgf3 was tagged by inserting the GFP sequence in between signal peptide (sec) and 
globular domain of Fgf3, the strategy previously used for Fgf8 (ref. 19). secGFP has 
the signal peptide of Fgf3 protein fused to GFP. The LexPR system was proved to 
be non-leaky as there was no detectable target gene expression in the absence of 
activator RU486. It was also proved to be spatially restricted, showing transactiva- 
tion only in Cxcr4b- expressing tissues. The dose-response of the inducible LexPR 
system was evaluated using the transactivation levels of the lecOP:nlsGFP respon- 
der after treatment for 6 h with 0, 5, 10 and 20 uM RU486. The lateral line primor- 
dium was imaged with the same acquisition settings for all samples. The average 
fluorescence signal from mean projected images was quantified, and these values 
were first background subtracted (calculated from untreated embryos) and then 
normalized to [0,1] range by dividing with the maximum signal. 

The BAC fgf3:fgf3-GFP line was generated by replacing coding sequence of 

the first exon in Fgf3 BAC (CH211-96B20) with a targeting cassette: secGFP/fgf3/ 
SV40poly(A)/FRT-KanR-FRT. The KanR cassette was subsequently removed with 
FLP recombinase. The cry:CFP/ AmpR ‘crystal eye’ cassette was inserted into the 
BAC backboneas a transgenic marker’®. BAC recombination and purification steps 
were followed as described’*. Whole-embryo overview images were generated using 
a Zeiss Lightsheet Z.1 microscope (X20). The functionality of the BAC line was 
tested by FGF knockdown rescue experiments, where both Fgf3 and Fgfl0a genes 
were knocked down owing to mutual compensation of the two ligands in lateral line 
system’. Fef3 (splice site blocker, 5 ng nl ')** and Fef10 (start site blocker, 5 ng nl ')** 
morpholinos were injected into BAC fgf3:fgf3-GFP, cldnb:lynGFP embryos at the 
one-cell stage, where half of the embryos were BAC fgf3:fgf3-GFP transgene car- 
riers as detected by cry:eCFP transgenic marker (Extended Data Fig. 6). The cxcr4b: 
cxcr4b-tagRFP line was generated by inserting TagRFP cassette into the Cxcr4b BAC 
(CH211-145M5) as described”®. 
Chemical treatments. SU5402 (Calbiochem) was used for inhibition of Fgfr1 kinase 
activity. Eight embryos per 2 ml of E3 buffer were used as standard treatment den- 
sity. For organ deposition experiments, embryos were treated with 0.5 and 1 uM 
$U5402 in 0.1% dimethylsulphoxide (DMSO) starting at 24 h.p.f.; controls were 
treated with 0.1% DMSO alone. Time-lapse imaging was started at 4-6 h after treat- 
ment, when the first organ was about to be deposited. Drug efficacy was observed to 
decrease over time-lapse imaging owing to light sensitivity of SU5402. Therefore, 
organ spacing was quantified as 2 d.p.f. measurements on embryos that were kept 
in the dark. 

RU486 (Sigma) was used to transactivate LexPR/LexOP driven gene expression. 
For organ deposition experiments, cxcr4b:lexPR, lexOP:fgf3-GFP embryos were 
treated with 5, 10 and 20 1M RU486 starting at 24 h.p.f. Time-lapse imaging was 
started at 4-6 h after treatment. Asa control group, cxcr4b:LexPR transgenics with- 
out lexOP:fgf3-GFP were treated with 10 1M RU486. 

Brefeldin A (BFA, Sigma) was used to visualize localization of Fgf3—GFP in the 
absence of secretion. Embryos were first treated with 15 uM RU486 for 4-6h to 
express Fgf3—GFP and then treated with 14 1M BFA for 30 min to block secretion. 
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Immunofluorescence and colorimetric in situ hybridization. Monoclonal anti- 
Fgfrla antibody was generated using the following peptide, corresponding to 140- 
360 amino acids of Fgfrla protein as an antigen: ‘KLSNDQNLPMAPVWAQP 
DKMEKKLHAVPASKTVKFRCQANGNPTPTLKWLKNGKEFKRDQRIGGFK 
VREHMWTIIMESVVPSDRGNYTCLVENRHGSINHTYQLDVVERSPHRPILQ 
AGLPANRTAVVGSDVEFECKVFSDPQPHIQWLKHIEVNGSRYGPDGLPYV 
RALKTAGVNTTDKEMEVLQIRNVSLEDAGEYTCLAGNSIGHSHHSAWLTV 
YKA’. For whole mount antibody staining, embryos were fixed with pre-cooled 85% 
methanol, 15% acetic acid for 3 min at —20 °C and rehydrated with methanol series 
75, 50, 25%, 3 min each at room temperature (~23 °C). Blocking was done with 
blocking buffer (1X PBS, 1% DMSO, 2% NCS, 1% BSA, 0.1% Tween) for 4h at 
room temperature. Embryos were then incubated with primary antibody (1:50 in 
blocking buffer) for 20 h at 4 °C. Embryos were washed with blocking buffer four 
times for 30 min at room temperature and incubated with Alexa 488-anti-mouse 
antibody (1:500 in blocking buffer) for 2.5 h at room temperature. Embryos were 
then washed with blocking buffer four times for 30 min and mounted in 1% LM aga- 
rose. Samples were imaged using an Ultraview VoX spinning disk confocal micro- 
scope with a X63 Zeiss water objective (1.2 numerical aperture). 

ZO1 antibody staining (anti-ZO1 primary antibody, Alexa-568-coupled anti- 
mouse secondary antibody) and pea3 and fgf3 in situ hybridization (DIG probes, 
anti-DIG alkaline phosphatase coupled antibody, NBT/BCIP substrate at 30 °C) were 
performed as described previously”. 

Analysis of migration and organ patterning. Embryos were imaged with PE Ultra- 
view ERS and PE Ultraview VoX spinning disk microscopes using Zeiss 5, X10, 
X20 air objectives. Multi-position time-lapse images were acquired from 10 to 
30 min intervals and a computational pipeline to analyse migration and organ pat- 
terning was established. First, images of individual embryos were stitched auto- 
matically by a macro using Grid stitching tool in FIJI°*. To analyse time-lapse movies, 
kymographs (x-t graphs) were generated using FIJI. For each embryo, a segmented 
line region of interest (ROI) was drawn along the migration path of the primor- 
dium with a thickness that covered the lateral line primordium. The image beneath 
the line ROI was re-sliced (from xy-t to xt-y) and maximum projected. This way 
maximum signal intensity along the width of the tissue was represented in the kymo- 
graph for each time point. Images were then saved as text images to be automatically 
processed with an R script. Organ positions were determined with a peak detection 
algorithm implemented in R package ‘Peaks’*’. To map the trajectory of each organ, 
kymographs were sequentially processed in reverse order from the last time frame 
to the first. A wide range of parameters (threshold from 10 to 50 in increments of 
10, and sigma from 3 to 9 in increments of 3) was used for peak detection, since the 
signal intensity profiles change over time. We manually checked whether the peaks 
identified at the last frame referred to an organ or not; only those that did refer were 
retained as starting points for tracking. For the remaining time points, the hypo- 
thetical position of each organ was initially estimated on the basis of the average dis- 
placement of the last three time points, then the closest position was sequentially 
linked to the trajectory between consecutive time points (Supplementary Video 2). 
Velocity and acceleration profiles of organs were generated from the migration tra- 
jectories using local polynomial fitting and its derivatives (KernSmooth package in 
R)**. Organ deposition was defined as the time point where acceleration of the indi- 
vidual organ unit was minimum. We defined three potential parameters that influ- 
ence organ patterning: (1) embryo growth, (2) primordium migration velocity and 
(3) organ deposition timing. As higher growth rate between two organ depositions 
could hypothetically result in increased spacing, we evaluated the effect of embry- 
onic growth by generating trajectories of manually segmented myotome borders, as 
embryonic landmarks, from kymographs generated using transmission light images. 
Next, myotome trajectories were subtracted from lateral line organ trajectories using 
the closest myotome for each organ and each time point. Finally, ‘growth-subtracted’ 
organ positions were calculated, revealing that in the absence of embryonic growth, 
organ spacing would decrease overall without much effect on relative spacing. We 
next evaluated the effect of primordium migration velocity and organ deposition 
timing on organ spacing. If primordium velocity was higher between two organ de- 
positions, or the following organ was deposited later, the spacing between these 
organs would increase. Correlation of these two parameters with spacing revealed 
that organ deposition timing is the main determinant of the global organ pattern- 
ing in WT embryos. 

CLEM. SecGFP, nls-tdTomato-expressing embryos were live imaged (sagittal plane) 
with a confocal microscope using a X 10 objective for whole-embryo overviews, to 
aid tissue sectioning, and X63 objective for high-resolution imaging of lateral line 
organs, to aid three-dimensional CLEM image construction. After live imaging, 
embryos were removed from agarose, anaesthetized and tails were removed by cut- 
ting after the yolk extension. Bodies were immediately fixed with 2.5% glutaralde- 
hyde and 4% paraformaldehyde in 0.1 M PHEM buffer for 14 min ina Pelco BioWave 
microwave containing ColdSpot (100 W cycling intervals of 2 min on and off under 
vacuum). Further processing was performed as described", although using 0.1 M 
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PHEM buffer instead of cacodylate buffer. Samples were flat embedded between aclar 
sheets and polymerized at 60 °C for 48 h. Lateral line organs were targeted for fur- 
ther processing by overlaying whole-embryo overviews of live imaging and images 
of fixed-embedded samples (CLEM targeting approach as described elsewhere’*). 
Melanocytes were used as landmarks to correlate the two data sets and then as guides 
to laser etch the block surface with an Olympus Cell4R with UV Cutting. Serial 
sections were cut 70 nm thick along the dorsoventral axis of the embryo (transverse 
plane) and placed on a copper palladium slot grid, coated with 1% Formvar (Serva). 

Electron microscope imaging was performed on a CM120 Phillips electron micro- 
scope. Serial images were aligned with Adobe Photoshop and structures of interest 
were tracked manually in 3dmod”. Electron and fluorescence microscopy images 
were further processed in Imaris 7.6.4 (Bitplane) for three-dimensional image han- 
dling. Nucleus positions of the target organs in electron microscopy images (dark 
grey) and fluorescence images (nls-tdTomato) were compared using the oblique 
slicer tool in Imaris to identify the correct transversal sectioning angle. Fluorescence 
images were then re-sliced using the identified angle. Shrinkage of electron micro- 
scope samples was calculated by comparing three-dimensional tissue size in electron 
microscope images and fluorescence images, then fluorescence images were resized 
accordingly. The central slice of electron microscope images and the corresponding 
fluorescence image were overlaid as shown in Extended Data Fig. 4m. A three- 
dimensional CLEM image construction of an organ centre with segmented struc- 
tures is displayed in Supplementary Video 7. 

FRAP and FLIP. Photo-bleaching experiments were performed using an Ultraview 
VoX spinning disk microscope equipped with a photokinesis unit and Zeiss X63 
water objective. Position accuracy of the laser pulse was calibrated using green fluo- 
rescent slides before each experiment. Experiments were performed on middle con- 
focal planes of secGFP and Fgf3-GFP pools. In FLIP experiments, five pre-bleach 
images were acquired (0.018 s per frame), then a small region (spot ROI with 0.73 um 
diameter) was repetitively bleached (45 time points) and the sample imaged in bet- 
ween (0.3 s per frame). Images were analysed by measuring mean intensity over 
time of (1) bleached region, (2) total pool, (3) background and (4) multiple other 
regions within the pool. In FRAP experiments, five pre-bleach images were acquired 
(30 ms per frame), then a strip ROI on the edge of the pool was bleached once and 
post-bleach images were acquired (45 time points, 30 ms per frame). Images were 
analysed by measuring mean intensity over time of (1) bleached region, (2) total 
pool and (3) background. Next, the measurements were uploaded to easyFRAP to 
calculate half-time of recovery with full-scale normalization and double term fitting”. 
Small spot ROI bleaching in the centre of the pool could not be used for FRAP 
experiments as the redistribution of the protein was too fast to catch recovery curves. 
This fast distribution could also be seen by the FLIP experiments with a spot ROI 
bleaching. 

Secretory pathway analysis. In vitro synthesized messenger RNAs (mRNAs) 
(100 ng pl *) encoding GM130-tdTomato and KDEL peptide fused to mKate2 were 
injected into one-cell stage embryos to label the Golgi apparatus and endoplasmic 
reticulum, respectively. GM130-tdTomato signal was segmented in three-dimensions 
and used as a landmark for density profile plotting of secGFP and Fgf3-GFP inten- 
sities within each cell. 

Single-cell overexpression experiments. Fgf3—GFP mis-expressing cell clones were 
generated by cell transplantation, following established protocols. Donor cells from 
cxcr4b:lexPR, lexOP:lynREP, lexOP:Fgf3-GFP, cxcr4b:nls-tdTomato transgenic em- 
bryos were transplanted into cldnb::lynGFP transgenic embryos, allowing Fgf3- 
expressing clones to be marked with nuclear tdTomato in membrane GFP-labelled 
hosts. Cxcr4b:nls-tdTomato cells were transplanted into cldnb::lynGFP embryos as 
controls. Time-lapse imaging was performed and the effect of FGF mis-expression 
was analysed by comparing migration behaviour of organs with and without clones. 
Primordium velocity, organ spacing and organ deposition timing between two 
consecutive depositions were quantified. To correct for intrinsic variation, which is 
high among WT organ intervals (see Fig. 1a and Extended Data Fig. 1), calculated 
values for each interval were normalized to the mean of the corresponding interval 
from controls. 

To generate Fgf3-GFP-overexpressing clones by mosaic expression, the lexOP: 
Fgf3-GFP plasmid was injected into cxcr4b:lexPR, cxcr4b:nls-tdTomato transgenic 
embryos at the one-cell stage. The next day, Fgf3-GFP expression was observed in 
randomly positioned cells. Overview images of the lateral line were acquired at 
2 d.p.f. to analyse organ patterning; water injected embryos were used as controls. 
Organ spacing in Fgf3-GFP microlumina-positive and -negative organs was ana- 
lysed by normalizing each interval to control embryos as described for the trans- 
plantation experiment above. 
smFISH. smFISH probes (Custom Stellaris FISH probes, Biosearch Technologies) 
were designed to target pea3 mRNA (ENSDART00000013033). Forty-eight sequence- 
specific oligonucleotides (listed below) were conjugated to the fluorophores Cal Fluor 
590 (red) and Quasar 670 (far red). Embryos were fixed and permeabilized fol- 
lowing standard zebrafish in situ hybridization protocols. smFISH was performed 


following the procotol of ref. 41, with the exception that 5X SSC replaced 2X SSC in 
the hybridization buffer, and embryos were stained with DAPI for 15 min at 30 °C 
after probe removal. Embryos hybridized with Cal Fuor 590 conjugated probes were 
mounted in Aquamount (Polysciences). Embryos hybridized with Quasar 670 con- 
jugated probes were mounted in GLOX buffer (0.4% glucose, 10 mM TrisHCl (pH 8), 
2X SSC, 0.16 mg ml! glucose oxidase, 0.02 mg ml catalase in ddH20) and imaged 
immediately to prevent bleaching. Imaging was performed using a X 100 Zeiss oil 
objective (1.4 numerical aperture) and a PE Ultraview VoX spinning disk micro- 
scope with 0.07 jm pixel size and 0.2 jum z steps. For Cal Fluor 590 conjugated probes, 
561 nm excitation, 620(W60) emission, and for Quasar 670 conjugated probes, 
640 nm excitation, 705(W90) emission, were used. 

smFISH images were analysed in Imaris 7.6.4 (Bitplane). First, a volume of inter- 
est (surface object) was defined by manually tracking borders at multiple z slices 
considering membrane and nucleus labelling (‘contour surface’ tool). Nuclei were 
counted using the spot segmentation tool with 2.5 jum estimated diameter, then iden- 
tified nucleus points were manually corrected for missing or fused selections. The 
RNA signal was counted using the spot segmentation tool with region growing, local 
contrast algorithms and 0.4 1m estimated diameter. Identified spots were filtered 
to have at least 0.4 ym diameter in the z dimension (Supplementary Video 9). Posi- 
tions of nucleiand RNA spots were exported from Imaris to be processed further in 
R. Transcript count per cell was calculated by simply dividing the number of iden- 
tified transcripts by the number of nuclei. 

Transcript profiles along the posterior-anterior axis of the primordium were gen- 
erated by fitting a line (the first principal component) to the nucleus positions along 
the long axis of the primordium. Then, segmented transcript and nuclei positions 
were projected on this line and their ratio along the primordium was plotted with 
10 jum sliding window. Transcript distributions of a mosaic embryo organ were rep- 
resented by assigning the transcripts to the closest nucleus position in two dimensions. 

To test the validity of the pea3 smFISH protocol to be used as FGF signalling 
read-out, embryos were treated with 4 4M $U5402 FGF inhibitor and 15 uM FGF 
inducer for 6 h. Pea3 smFISH protocol was applied on four WT, four FGF-induced 
and four Fgfr-inhibited primordia (an average of 150, 143 and 195 cells per pri- 
mordium respectively), and transcript counts per cell were plotted. 

The relation between luminal FGF levels and pea3 transcription response was 
tested by inducing lexOP:Fgf3-GFP expression with 5 1M and 20 1M inducer for 
6h to generate a wide range of expression levels. Organ 1 of each embryo was imaged 
using an Ultraview VoX spinning disk confocal microscope and a X63 Zeiss water 
objective (1.2 numerical aperture) with the same imaging settings. Embryos were 
then fixed and processed individually for smFISH protocol to compare their lumi- 
nal Fgf3—-GFP intensity with pea3 transcript counts. 

Pea3 smFISH oligonucleotides: 1, AAGGAAGACGGACAGAGGCA; 2, CTG 
TGTTTTAATGAGCTCCA,; 3, CTTAACCGTTTGTGGTCATT; 4, CCATCCAT 
CTTATAATCCAT; 5, AGTATAAGGCACTTGCTGGT; 6, ATTTCCTTGCGA 
CCTATTAG; 7, TCAACAGTCTATTTAGGGGC; 8, ATGTATTTCCTTTTTGT 
CGC; 9, AAGAGGTCTTCAGATTCCTG; 10, CCTGAAGTTGGCTTAAATCC; 
11, GGAACTTGAGCTTCGGTGAG; 12, AACAAACTGCTCATCGCTGT; 13, 
CACTGAGTTCTCTGAGTGAA; 14, TTCTTAATCTTCACAGGCGG; 15, TAG 
CTGAAGCTTTGCTTGTG; 16, TCATAGGCACTGGCGTAAAG; 17, CTGGA 
CATGAGCTCTTAGAT; 18, TTGGGGGAATAATGCTGCAT; 19, TGAGGGT 
GGATTCATATACC; 20, CGGAAGGGAACCTGGAACTG; 21, AGAGTGTTG 
CCGATGGAAAGC; 22, TGCTGAGGAGGATAAGGCAA; 23, CCATGTACTCC 
TGCTTAAAG; 24, TCCTGTTTGACCATCATATG; 25, CAGGTTCGTAAGTG 
TAGTCG; 26, TGTGATGGTACATGGATGGG; 27, AAACATGTAGCCTTCA 
CTGT; 28, T@€GCACAACACGGGAATCAT; 29, TCACCTCACCTTCAAATT 
TC; 30, ACCTTCACGAAACACACTGC; 31, TAGTTGAAGTGAGCCACGAC; 
32, GAAGGGCAACCAAGAACTGC; 33, ATGCGATGAAGTGGGCATTG; 34, 
ATGAGTTTGAATTCCATGCC; 35, TTGTCATAGTTCATGGCTGG; 36, GTA 
ACGCAAAGAGCGACTCA; 37, TTTTGCATAATTCCCTTCTC; 38, AGGTTA 
TCAAAGCTTCTGGC; 39, CGCTGATTGTCGGGAAAAGC; 40, GTTGACGT 
AGCGCTCAAATT; 41, AAGAAACTCCCTCATCGAGG; 42, TACATGTAGC 
CTTTGGAGTA; 43, AAAGGAGAATGTCGGTGGCA; 44, GTGGTAAACTGG 
GATGGGAA; 45, ATACAAGAGGATGGGGTGGG; 46, GAATGCAGAGTCC 
CTAATGA; 47, AGATAGGCCTCAGAAGTGAG; 48, GCAATCTCTTGAACC 
ACAGT. 

Shroom3 knockdown. Shroom3a was knocked down using a previously published 
morpholino (5’-CCTAATAAATTGTTACCTGACTAAC-3’, Gene Tools, 4.2 pmol 
per embryo)™. Consistent with the published report, the effect of knockdown on 
apical constriction was observed to be transient. To measure Pea3 levels in the ab- 
sence of microluminal trapping, only primordia without visible apical constriction 
were processed further for smFISH analysis. 

Laser micropuncture. Micropuncture experiments were performed with a Zeiss 
LSM 780 NLO 2-Photon microscope with a Zeiss X 63 water objective (1.2 numer- 
ical aperture). SecGFP and Fgf3-GFP pools were focused, and a laser (two-photon 
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960 nm laser) pulse was applied on different regions of the pool. A minimal pulse 
size (ROI diameter) of 0.43 j1m was selected to allow lumen opening with unde- 
tectable damage to participating cells. Targeting single pulses at these settings to 
the middle section of the pool resulted in a one-time reduction in total fluorescence 
signal, which we term FLIP. Targeting identical pulses to the microluminal lattice 
caused micropuncture, as revealed by a characteristic decay caused by leaking of the 
GFP pool through time. For mean fluorescence intensity plots, five time points 
before the pulse and 20 or more time points after the pulse were acquired, at a rate 
of 1 s per frame. To perform analysis of target gene response (pea3) to micropunc- 
ture, embryos were treated sequentially over a 30 min session. After the laser sur- 
gery, embryos were incubated for 30 min, allowing each organ 30-60 min response 
time after micropuncture before fixation for smFISH analysis. As there was high 
intrinsic variability of transactivation using the LexPR system (Extended Data Fig. 2), 
this resulted in variability in target gene response (Fig. 3). Therefore, to allow direct 
comparison of transcript counts, values were normalized to the first organ of each 
embryo that was left unperturbed. In control embryos, there is a clear trend where 
more mature organs have higher expression of pea3, presumably because of longer 
or higher exposure to microluminal FGF. Thus, unperturbed first and third organs 
provided internal controls. Laser micropuncture causes the second organ to have 
significantly lower transcript counts than the less mature third organ, a result never 
observed in non-micropunctured controls. 

The effect of micropuncture on FGF signalling was further investigated by per- 
forming smFISH protocol immediately after (f < 2 min), 1 h after and 4h after micro- 
puncture. The transcript count per cell of the punctured organ 2 was normalized 
internally to the unperturbed organ 3. 

To test the effect of microluminal FGF loss after micropuncture on collective cell 

behaviour, different organs before their depositions were micropunctured and the 
end-point spacing of the corresponding organ was compared with the unperturbed 
siblings. For Fgf3-GFP overexpression experiments, embryos were induced with 
20 uM inducer at 24 h.p.f. At 28 h.p.f., embryos with similar Fgf3-GFP overexpres- 
sion levels were pre-selected to eliminate sample variability due to drug induction. 
Organ 2 or 3 was punctured while organ 1 or organ 2 was being deposited, respec- 
tively. For secGFP expression experiments, the same strategy was followed without 
pre-screening for expression levels as secGFP is a neutral marker to visualize intact 
luminal space. 
Statistical analysis. All statistical analysis used R. A non-parametric Wilcoxon rank- 
sum test (two sided) was used to compare two groups. Sample sizes (1) and P-values 
(P) for each experiment are indicated in figure legends and exact P values are listed 
below. The statistical dependence between two variables was assessed with Spear- 
man’s rank correlation coefficient. 

In experiments of organ patterning, more than 50 samples were acquired and 
automatically analysed to ensure adequate statistical power. For the experiments of 
transplantation, micropuncture and smFISH analysis, the number of samples was 
mainly constrained by the complexity of the experimental procedure, data acquisi- 
tion and analysis capacity. In each experiment, data were analysed after the com- 
pletion of data collection. To prevent selection bias, samples of stage and genotype 
matched pools were randomly divided into experimental groups. Additionally, data 
were analysed automatically where possible to avoid subjective assessments (for 
example, analysis of organ patterning and smFISH transcript counts). 

Boxplots are standard box and whisker plots showing median and interquartile 
range. For all the experiments, original data points were displayed as scatter plots 
on top of boxplots using the beeswarm package”, allowing direct examination of 
the variance and distribution of the samples. 
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Pvalues and sample sizes. Fig. 1b: Spearman 1” = 0.77, N = 82, n = 260. Fig, 1c: 
Netxt = 82, Nos = 114, m = 104, Peutos = 3.94 X 10-4, Pos.1 = 6.99 X 10°”. Fig. 1d: 
Netr| = 64; Ns = 57, Nyo = 65, Nog = 58, Pers = 3.87 X 10°, Ps_19 = 3.71 X 10 4, 
Pio 29 = 1.36 X 10°. Fig. 3d: Nota = 38, Nmosaic = 114, next = 141, ttneg = 170, 
Npos = 240, Ptri-neg = 0.9203, Pout-pos = 1.29003 X 10° *8, Paeg_pos = 3.80489 X 10° *. 
Fig. 3e: Nur =4, Ninducer =4, Ninhibitor =4, Pwr-inducer = 0.0285, Pwr-inhibitor = 
0.0285. Fig. 3f: Spearman r =0.85, N= 18. Fig. 3g: Nei = 4; Mneg = 95 Npos = 7, 
Petrt-neg = 0.6042, Prog pos = 0.0001748. Fig. 4b: Nwr = 6, Nehroommo = 6, P= 
0.00216. Fig. 4e: Neti = 6, Npuncture = 16, P = 0.0003217. Fig. 4f: second organ: 
Nett = 16, Npuncture = 23, P = 1.443 X 10” ®; third organ: Newt = 7s Npuncture = 75 
P= 0.002331. Fig. 4g: Neosed = 6 Nopen = 6, P = 0.002165. Extended Data Fig. 1: 
Spearman N = 82, n = 260, x-t r = 0.77,x-vr° = 0.25, v-tr? = —0.07. Extended 
Data Fig. 2b: Ns = 44, Nig = 34, Ngo = 32, Ps_19 = 2.53 X 10°, Pyo-29 = 4.03 X 
1078. Extended Data Fig. 2d: net = 78, 005 = 71,11 = 74, Pet.os = 6.22 X 1074, 
Pos. = 7.26 X 10 +. Extended Data Fig. 2e: Ney = 109, ns = 112, m9 = 119, no 

137, Petts = 1.33 X 10° 1°, Ps_19 = 7.06 X 10 +, Pyo-29 = 2.46 X 10 +. Extended 
Data Fig. 2f spacing: New = 49, Nmut = 39 P= 7.69 X 10°"; timing: Noa = 31, 
Nmut = 28, P= 4.64 10~''. Extended Data Fig. 6d: Nwr=9, Nrescue = 13, 
Necemo = 14; Pwr-rescue = 0.09; Prescue-EGEmo = 1.751 X 10~®. Extended Data Fig. 7b: 
Neontrol +7; Niransplants 8; Ncontrol = 25; Meg 17, Npos 13; spacing: Potel_neg =, 
0.24,P., = 1.43 x 10°, Preg-pos = 4.40 X 10° *; timing: Pat-neg = 0.07; Petsl-pos = 
1.23 X 10°, Preg-pos = 4.09 X 10°. Extended Data Fig. 9d: Noh puncture = 6 
Non control = 9; Nin puncture — 6, Nin control = 9» Nan puncture — 6, Nah control = 95 
Pop = 0.7922, P; y = 0.0043, Py, = 0.4286. Extended Data Fig. 9e: Net second organ = 


22,N, puncture second organ — 23, Neu third organ — 8,N, puncture third organ — 9, P. second organ — 
6.928 X 10° °, Pthird organ = 0.0061. 
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Extended Data Figure 1 | Quantitative analysis of lateral line organ 
deposition. a, Posterior lateral line organs at 2 d.p.f. (cldnb:lynGFP). Organ 
positions were identified from intensity profiles using peakFinder_R. b, Density 
profile of distance between consecutive organ positions (first, second, third and 
fourth spacing interval; see Fig. 1a). c, List of potential parameters affecting 
organ spacing. d, cldnb:lynGFP and brightfield overlay image. Spheres indicate 
colour code representing individual organs used in further analysis. e, Upper 
panel, kymograph (x-t graph) from a 17.6 h time-lapse movie, where the y axis 
represents time and the x axis represents distance. Lower panel, segmented 
kymograph of primordium migration (green) and myotome growth (dashed 
lines) through time. f, g, Calculated position (f) and velocity (g) of each organ 


through time. Asterisk shows the time point when organ disengages from the 
migrating collective. h, Second organ acceleration through time. Organ 
deposition is defined as the time where acceleration is minimum. i, Growth- 
effect-subtracted velocity of each organ through time (solid lines) versus 
observed velocities (dashed line). j, Reconstruction of organ positions from 
growth-subtracted velocities. k, Comparing spacing, average velocity and time 
between consecutive depositions for first, second, third and fourth interval 
(normalized to maximum). |, Correlation of time, distance and average 
velocities between consecutive depositions. Statistics: Spearman N = 82, 

n = 260, x-t 7° = 0.77, x-v r = 0.25, v-t ° = —0.07. Scale bars, 500 um 

(a), 200 um, 5h (d, e). 
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Extended Data Figure 2 | ‘Tunable’ drug-inducible gene expression with 
LexPR and quantification of dose-dependent response to FGF signalling. 
a, Schema shows transactivator LexPR expressed under the control of CKCR4b 
promoter (Cxcr4b:LexPR) driving expression of LexOP-coupled coding 
sequences upon addition of inducer RU486 (above). Image of cxcr4b:LexPR- 
driven expression of lexOP:nlsGFP showing spatially restricted expression 
upon RU486 treatment. Scale bar, 500 jum (cry:eCFP ‘crystal eye’ marker: 
orange arrow; clmc2:GFP ‘bleeding heart’ marker: white arrow). b, Mean 
fluorescence intensity projection of Cxcr4b:LexPR, LexOP:nlsGFP primordium 
treated with 5 (n = 44), 10 (n = 34) and 20 uM (n = 32) RU486. Scale bar, 
50 um. Plot shows quantification of signal intensity after 4h of RU486 
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induction (Ps_19 = 2.53 X 10°, Pyo-29 = 4.03 X 107°). c, Colorimetric in situ 
hybridization of fgf3 mRNA in Cxcr4b:LexPR, LexOP:Fgf3-GFP showing 
uniform expression. Scale bar, 50 jm. d, e, Organ spacing in FGF inhibitor- and 
inducer-treated embryos at 2 d.p.f. d, Quantification of organ spacing (n = 78, 
71, 74, Poios = 6.22 X 1074, Pos_, = 7.26 X 10 *) in $U5402-treated 
samples. e, Quantification of organ spacing (n = 109, 112, 119, 137, 

Pots = 1.33 X 107), Ps_ 9 = 7.06 X 104, Pig 29 = 2.46 X 10°“) in RU486- 
treated samples. f, Organ depositions in WT and homozygous fgfrla*7°°"" 
mutants shown by kymographs of 21h time-lapse movies. Quantification of 
spacing (n = 49, 39, P= 7.69 X 10") and deposition timing (n = 31, 28, 
P=4.64 X 10 1) between organs (first interval). Scale bar, 200 jum, 5h. 
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Extended Data Figure 3 | Organ deposition and rosette formation rate (left) and total number of organ progenitor rosettes assembled (right) through 
upon Fgf3-GFP overexpression. a, cldnb:lynGFP embryos showing time in control (blue) and lexOP:fgf3-GFP (red) embryos. Only organ 
comparison of organ deposition and rosette formation rate upon lexOP:fgf3- deposition timing shows a clear difference between these conditions. c, Plots 
GFP overexpression. b, c, Comparisons of total number of organs deposited showing multiple examples of data in b (n = 7, 7). 
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Extended Data Figure 4 | SecGFP and Fgf3-GFP localization in apically 
polarized secretory path. a, Golgi, labelled by GM130-tdTomato (white) 
mRNA injection, are localized apically around rosette centres in lateral line 
primordium (cldnb:lynGFP, red). Scale bar, 20 um. b, Maximum projection 
of apical optical sections of a transgenic lexOP:fgf3-GFP primordium, 
counterstained for ZO1, shows intracellular Fgf3-GFP signal around rosette 
centres in addition to luminal signal. Scale bar, 50 jm. ¢, Single cell expressing 
Fgf3-GFP feeds the central microlumen through apical secretion (expressing 
cell indicated with yellow dashed line). Scale bar, 5 jm. d, Mosaic primordium 
showing apically localized intracellular Fgf3—-GFP signal co-localizes with Golgi 
marker GM130-tdTomato. Scale bar, 5 um. e, Intracellular Fgf3-GFP and 


secGFP localization at secretory path. f, Golgi (GM130-tdTomato) co-labelling 
with secGFP. Scale bar, 5 jm. g, Endoplasmic reticulum (mKate2-KDEL) co- 
labelling with secGFP. Scale bar, 5 um. h, Signal distribution of secGFP and 
Fgf3-GFP in three dimensions within the expressing cell where Golgi was taken 
as a central point. Comparison of Fgf3-GFP and secGFP density profiles 
suggests that Fgf3-GFP is more pronounced in Golgi (Meccrp = 5; 

Ngge3-GEp = 4). i, Imaging of Fgf3-GFP-expressing clones (white dashed lines) 
with high sensitivity reveals Golgi localization (yellow arrowheads) of Fgf3- 
GFP in expressing cells close to the microlumen (asterisk) and intracellular 
vesicles in connected non-expressing cells (white arrowheads). No extracellular 
signal besides microluminal accumulation was detected. Scale bar, 5 um. 
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Extended Data Figure 5 | CLEM analysis of microlumen structure and 
FLIP/FRAP analysis of microluminal pools. a, Overview of lexOP:secGFP; 
cxcr4b:nls-tdTomato embryo used for CLEM; two organs and migrating 
primordium were targeted for further processing. Scale bar, 200 jum. b, Re- 
sliced middle section of targeted organ centres, overlay of secGFP signal with 
corresponding EM slice (scale bar, 5 jum) and close-up view of microlumina. 
c, Close-up view of luminal cavity (green) distorted by kinocilium (blue). 

d, Traced tight junctions (red) and adherens junctions (orange) at three 


cross-sections of microlumen. e, Setup of FLIP experiment on Fgf3-GFP and 
secGFP pool highlighting repetitively bleached region (0.73 jum diameter, 

red circle) and regions used for total pool (green circle), background (grey box) 
and readout (blue circles) measurements. Plots show mean intensity of 
described ROIs over time. f, FRAP experiment on secGFP and Fgf3-GFP pools 
with a strip ROI. Mean normalized recovery curves (mean + s.d., N = 7) 

and calculated half time of recovery. Arrow indicates start of bleaching. 
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Extended Data Figure 6 | BAC fgf3:fgf3-GFP rescues FGF loss of function _ by crystal eye transgene marker (yellow star). Scale bar, 200 tum (b). 

in lateral line. a, BAC fgf3:fgf3-GFP line showing expression in known Fgf3 _—_d, Quantification of rescue by comparing organ counts of WT, fgf3:fgf3-GFP 
expression domains (28 h.p.f.). Scale bar, 200 jum. b, Loss-of-organ deposition with Fgf3/10 MO background and Fgf3/10a MO alone at 2 d.p.f. 

phenotype Fgf3/10a morphant embryos (Fgf3/10a MO, upper) is rescued by (Nwr = 95 Nrescue = 13, Neges/10a_mo = 14, Pwr-rescue = 0.09, 

BAC fgef3:fgf3-GFP transgene (lower). c, Low-magnification image showing Prescue-Fef3/10a_MO = 1.751 X 10 °). 

Fgf3/10 morphants, with BAC fgf3:fgf3-GFP rescued siblings, distinguished 
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Extended Data Figure 7 | FGF signalling range is restricted to individual 
organ progenitors. a, Kymographs of mosaic Fgf3-GFP expression generated 
via cell transplantation. lexOP:fgf3-GFP/cxcr4b:nls-tdTomato-expressing 
clones (green) in the cldnb:lynGFP line (red) cause rapid arrest of migration. 
The phenotype only becomes apparent when the organ reaches tissue rear. 
(Colour code: organs with ectopic FGF source in green; organs without ectopic 
FGF source in red; organs of control transplants in blue.) Scale bars, 200 um, 
5h. b, Quantification of spacing and deposition timing of organs from mosaic 
Fgf3-GFP transplants, normalized by mean values of control embryos for 
each interval (Neontrol =7, Meransplants =8, Ncontrol 25, Meg 17, Npos 13; 
spacing: Port-neg = 0.24, Peta-pos = 1.43 X 10°, Paeg-pos = 4.40 X 10”; 
timing: Pcta-neg = 0.07; Petst-pos = 1.23 X 10°, Preg-pos = 4.09 X 10° *). 
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c, Close-up view of Fgf3-GFP (green)/nls- tdTomato- (red) expressing 
clones in cldnb:lynGFP- (green) expressing organ, showing cells in different 
positions feed in the central microlumen. Scale bar, 5 um. d, Tracking of WT 
transplanted cells (nuclei marked with grey dots and numbered) relative to 
organ centres in cldnb:lynGFP primordium (red). Yellow circles represent 
each organ unit. Middle panel: calculated velocities for each tracked nucleus 
(grey lines) and organ centres (green lines) reveal that migration of individual 
cells is in synchrony with the belonged organ unit independent of their position. 
Right panel: distance between consecutive tracked cells at the beginning 

and end of the time-lapse movie shows that initial distance is not a reliable 
indicator of final cell positions. 
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Extended Data Figure 8 | smFISH analysis of FGF target-gene regulation. 
a, Pea3 smFISH on WT, 15 LM FGF inducer- and 4 uM FGF inhibitor-treated 
primordia (cldnb:lynGFP in green, DAPI staining in blue, pea3 mRNAs in 
white), Scale bar, 5 jm. Close-up view of the dashed boxes shown as raw image 
(middle) and segmented pea3 transcripts (right). Scale bar, 2 um. b, Image of 
pea3 smFISH in WT primordium (above); profile plot shows pea3 transcripts 
per cell over distance from leading edge (below). c, Pea3 smFISH in an 

organ with single Fgf3-GFP-expressing cell. Number of pea3 transcripts 
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assigned to each nucleus does not show increase towards the expressing cell. 
Scale bar, 5 um. d, Colorimetric in situ hybridization of pea3 mRNA showing 
high expression levels upon Fgf3-GFP induction, visible after 1h colour 
reaction (30 °C), whereas expression in WT is hardly detectable. However, 
increasing reaction time reveals pea3 mRNA signal in WT primordia. 

e, Colorimetric in situ hybridization (30 °C, 0.5h) of pea3 RNA in mosaic 
Fgf3-GFP expression shows detectable pea3 only in the expressing organ. 
Scale bar, 100 um. 
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Extended Data Figure 9 | Characterization of luminal integrity and 
function upon mechanical and genetic perturbation. a, Plot of secGFP pool 
fluorescence intensity upon micropuncture (green). Kymographs show the 
time-lapse imaging of the secGFP pool used for the plot. b, c, Luminal 
Fgf3-GFP signal recovery of whole pool bleached (left) and micro-punctured 
(right) organs during 48 min of acquisition. Kymographs show time-lapse 
imaging of Fgf3-GFP pool. Single time points of time-lapse imaging after 
micro-puncture (right). Scale bar, 5 im. d, Quantification of pea3 transcript 
levels at t= 0h, 1h and 4h after micropuncture of organ 2 expressing 
lexOP:fgf3-GFP. Unperturbed organ 3 was used for normalization. 
Comparison of control and punctured organs suggests that pea3 levels are 


Control Puncture 


Organ 2 Organ 3 


normal immediately after puncture, are reduced 1h later and recovered by 
4h (Non puncture — 6, Non control — 5; N, h puncture — 6, N, h control = 5, 

Nah puncture = 6, Nah control = 5; Pon = 0.7922, Pin = 0.0043, Pan = 0.4286). 
e, Organ deposition delay upon lumina micropuncture of secGFP-expressing 
second and third organs (Netri second organ — 22, Npuncture second organ — 23, 
Neu third organ — 8, Nouncture third organ — 9, Psecond organ — 6.928 X 10°, 

Pinird organ = 0.0061). Scale bar, 200 um. f, g, Shroom3 morphant primordia 
show intervals with no or delayed deposition. f, Kymographs of shroom3 MO 
and control. Scale bars, 200 um, 5h. g, Organ pattern in shroom3 MO and 
control at 2 d.p.f. Scale bar, 200 um. 
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Extended Data Figure 10 | Loss of microlumen pool upon fusion to nls-tdTomato-expressing embryo. SecGFP signal disappears as microlumen 
overlying skin. a, Microlumen of maturing organs fuses with the skin andthe _ opens (arrowheads in kymograph show opening of microlumina). Scale bars, 
diffusible content (Fgf3-GFP in green) disappears. Tight junctions marking 200 pum, 5h. c, Side view of a maturing organ with kinocilia protruding out 
microlumen and skin borders are revealed by ZO1 immunofluorescence (red). _ of the organ (cldnb:lynGFP in green, central cell atohla:tdTomato in red). 
Cartoon displaying the sequence of events (right). Scale bar, 5 jim. Scale bar, 5 um. 

b, Kymograph and single time-points from time-lapse imaging of secGFP, 


©2014 Macmillan Publishers Limited. All rights reserved 


Nae I Tea 


doi:10.1038/nature13663 


PLETHORA gradient formation mechanism 


separates auxin r esponses 


Ari Pekka Mah6nen!*#*, Kirsten ten Tusscher*, Riccardo Siligato’’, Ondrej Smetana’?, Sara Diaz-Trivifio”®, Jarkko Salojiarvi’, 
Guy Wachsman’, Kalika Prasad’, Renze Heidstra”* & Ben Scheres”” 


During plant growth, dividing cells in meristems must coordinate 
transitions from division to expansion and differentiation, thus gen- 
erating three distinct developmental zones: the meristem, elonga- 
tion zone and differentiation zone’. Simultaneously, plants display 
tropisms, rapid adjustments of their direction of growth to adapt to 
environmental conditions. It is unclear how stable zonation is main- 
tained during transient adjustments in growth direction. In Arabi- 
dopsis roots, many aspects of zonation are controlled by the phyto- 
hormone auxin and auxin-induced PLETHORA (PLT) transcription 
factors, both of which display a graded distribution with a maximum 
near the root tip?’. In addition, auxin is also pivotal for tropic 
responses’**. Here, using an iterative experimental and computa- 
tional approach, we show how an interplay between auxin and PLTs 
controls zonation and gravitropism. We find that the PLT gradient 
is not a direct, proportionate readout of the auxin gradient. Rather, 
prolonged high auxin levels generate a narrow PLT transcription 
domain from which a gradient of PLT protein is subsequently gen- 
erated through slow growth dilution and cell-to-cell movement. The 
resulting PLT levels define the location of developmental zones. In 
addition to slowly promoting PLT transcription, auxin also rapidly 
influences division, expansion and differentiation rates. We demon- 
strate how this specific regulatory design in which auxin cooperates 
with PLTs through different mechanisms and on different timescales 
enables both the fast tropic environmental responses and stable zo- 
nation dynamics necessary for coordinated cell differentiation. 

We have previously shown that four PLT transcription factors with 
graded distribution (PLT1, PLT2, PLT3 and BBM (also known as PLT4)) 
are necessary for stem cell maintenance and cell division in the root*”. 
Furthermore, correlation of PLT protein levels with the developmental 
transitions that define root zonation (Fig. la) suggests a dosage-dependent 
control by PLTs’. However, two issues remain unresolved. 

First, the precise relationship between PLT dosage and the location 
and size of the stem cell domain has not been established. Therefore, 
we investigated whether different PLT levels mediate the distinction 
between slowly dividing stem cells and fast dividing transit amplifying 
cells. The addition of extra copies of PLT2 led to an enlarged meristem 
and shootward shift of the high-division-rate domain (Fig. 1b, c and 
Extended Data Fig. 1a, b), indicating that the highest dose of PLT2 slows 
down division rates as observed in the stem cell niche, while medium 
levels trigger high division rates shootward from the stem cell region. 

Second, it remained to be established whether, similar to stem cell fac- 
tors in the animal kingdom, PLT transcription factors repress differenti- 
ation. In that case, expression of PLT2 in one cell type should be sufficient 
to block differentiation locally while allowing differentiation of other cell 
types. To test this, we induced yellow fluorescent protein (YFP)-tagged 
PLT2 using either a protoxylem and the associated pericycle-specific pro- 
moter pAHP6 (ref. 15) or an epidermal/lateral root cap promoter pWER"*. 
pAHP6:X VE>PLT2- YFP induction inhibited protoxylem differentiation 


and caused local ectopic cell proliferation while root hair differenti- 
ation proceeded normally. Reciprocally, pWER:X VE>>PLT2- YFP induc- 
tion triggered local inhibition of root hair differentiation and ectopic cell 
division while protoxylem differentiation proceeded normally (Fig. 1d 
and Extended Data Fig. 1c). Furthermore, induction of PLT2 inhibited 
cell expansion, which is generally considered to be an early step in cell 
differentiation. The speed at which PLTs control expansion suggests that 
the decline in PLT levels along the gradient determines the transition to 
differentiation (Supplementary Notes and Extended Data Fig. 1d, e). 
Finally, we tested whether this differentiation threshold was imposed 
also by physiologically relevant PLT concentrations. Reduction of PLT2 
by inducible RNA interference (RNAi) in the p/t1,3,4 mutant, which 
solely depends on PLT2 to form functional meristems’, indeed triggered 
meristem cell expansion and differentiation (Extended Data Fig. 1f). 
Taken together, our results show that the PLT protein gradient shape 
defines the location of at least two boundaries: the boundary between 
slowly and rapidly cycling cells, and the shootward boundary of the 
meristem. 


CYCB1;1-GFP x 
pPLT2:PLT2-YFP 


Figure 1 | PLT levels define zonation boundaries. a, Zonation of 4-day-old 
wild-type root. Arrows and arrowheads indicate youngest protoxylem cell. 
The meristem (M), expansion (EZ) and differentiation (DZ) zones are 
highlighted. b, c, Frequent cell division, monitored by the G2/M-phase cell 
cycle marker CYCB1;1-GFP, occurs close to the quiescent centre (arrow) in 
wild-type meristem (b). This domain shifts shootward with increased PLT2 
dosage (that is, homozygote pPLT2:PLT2-YFP in Col background; green and 
green/yellow channels shown) (c). d, Twenty-four hours induction of PLT2- 
YFP in the vascular tissue (left) locally inhibits xylem differentiation (arrow, 
first xylem element), while PLT2-YFP induction in epidermis (right) inhibits 
root hair formation (arrowhead, first root hair). Propidium iodide highlights 
cell wall and protoxylem in b-d. Scale bars, 50 um. 
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PLT gradients have been considered to be generated at the transcrip- 
tional level, based on the similarity of transcriptional and translational 
PLT-fluorescent protein fusion gradients’. PLT transcription requires 
auxin response factors (ARFs)*"”, and since auxin is also present in a 
graded pattern*”, it was postulated that the PLT gradient may bea read- 
out of the auxin gradient. To study in detail how PLT protein gradients 
are defined, we first investigated the timescale and levels at which PLT 
expression is controlled by auxin. Prolonged auxin (indole acetic acid 
(IAA)) treatment rapidly induced the auxin response marker DR5:GFP”, 
especially when combined with the auxin transport inhibitor, 1-N- 
naphthylphthalamic acid (NPA), but the expression domain of PLTs 
failed to expand rapidly (Fig. 2a, b and Extended Data Fig. 2a—c). Only 
after prolonged IAA plus NPA treatment (24-72 h) did expression of 
PLT-YFPs and the quiescent centre stem cell organizer marker pWOXS: 
GFP shift shootward, mostly in the meristematic ground tissue (Fig. 2b 
and Extended Data Fig. 2a—c). This was associated with morphological 
changes, suggesting that the new PLT expression domain correlated 
with cell fate changes similar to those described for prolonged NPA 
treatment*. Our experiments thus indicated that PLT induction requires 
prolonged high auxin levels. To test the implications of these findings, 
we developed a simulation model of root zonation. The model incor- 
porates a description of root tissue architecture, a generalized PLT-ARF 
gene regulatory network, root PIN-FORMED (PIN) protein patterns 
governing auxin transport, and cell growth, division, expansion and 
differentiation. The resulting model (‘initial’ model; see Supplementary 


DR5:erGFP 


Sh 24h 


PLT2-YFP 


Max level 


Min level 
Aux: log(0.05) — log(3,500) a.u. 


PLT: 0-70 a.u. 


10/10 A7/N7 


Figure 2 | The PLT2 gradient is not a fast readout of the auxin gradient. 

a, b, Four-day-old seedlings transferred to agar plates containing 20 1M NPA 
plus 5 uM TAA for the indicated times. Auxin response reporter DR5:erGFP 
(a) rapidly responds to treatment whereas pPLT2:PLT2-YFP (b) accumulates 
later (white arrow), associated with repatterning (inset in b, magnified image of 
bracketed region with altered cell division planes at arrowheads). Black 
arrow indicates fluorescent region after NPA plus IAA treatment (a), but not 
after [AA treatment (Extended Data Fig. 2b). Observed phenotypes/number of 
roots analysed is indicated in the right bottom corners. c, Failure of PLT 
gradient formation in the initial model. Snapshots of auxin (Aux), PLT 
transcription (PLT) and PLT protein (PLT) profiles under steady-state root 
growth dynamics are displayed. a.u., arbitrary units. Scale bars, 50 jum. 
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Notes, Supplementary Methods and Extended Data Fig. 3) predicts a 
PLT gradient with shorter range due to its dependence on high auxin 
levels, in disagreement with experimental observations (Fig. 2c, Sup- 
plementary Video 1 and Extended Data Fig. 4). Moreover, aux1,ein2,gnom 
triple mutants, which display a more shallow auxin gradient along the 
root tip as inferred from direct auxin and auxin response measurements’®, 
nevertheless possess a normal range PLT2-YFP gradient (Extended Data 
Fig. 2d). Together, this demonstrates that the PLT protein gradient is 
not a direct readout of the auxin gradient. 

We investigated how the experimentally observed long PLT protein 
gradient could arise despite the narrow, non-graded expression domain 
predicted by our model. One potential explanation emerged when we 
noticed that PLT2-YFP expression in pAHP6:X VE>>PLT2-YFP lines 
did not only appear in the narrow AHP6 transcription domain (erGFP 
(where erGFP is a variant of GFP localized to the endoplasmic reticulum) 
in Fig. 3a), but also in the neighbouring cells (PLT2-YFP in Fig. 3a), 
suggesting that the protein might influence gradient shape by acting 
as a mobile plant transcription factor (for a review of this topic, see 
ref. 19). To ascertain this, we introduced red fluorescent protein (RFP)- 
tagged PLT2 into a clonal activation system” and generated small clones 
of PLT2-RFP-expressing cells in the meristem. After induction, PLT2- 
RFP not only resided in clones (marked with green fluorescence) but 
also in 1-2 cells surrounding the clones (Fig. 3b). When the clones entered 
the elongation zone, the cells in the clone and the adjacent PLT2-RFP 
cells remained meristematic and failed to expand (n = 7), while cells 
shootward and rootward from the clone ceased cell division and ex- 
panded (Extended Data Fig. 5a—e and Fig. 3b). These data demonstrate 
that either PLT2 protein or PLT2 transcript moves to the adjacent cells, 
yielding translocated functional PLT2-RFP. In addition, the clonal data 
demonstrate that the inhibition of cell expansion is not the result of a 
community effect, in which cells in a larger longitudinal region collec- 
tively determine whether to expand, but an effect of local PLT levels 
within the cell file. Fusion of three copies of YFP to PLT2 significantly 
constrained intercellular movement (Fig. 3a), and when PLT2-3 * YFP 
was expressed under the PLT2 promoter it complemented the stem cell 
defect of pit1,2, but led to a shorter meristem than when PLT2-YFP 
was used, indicating that PLT cell-to-cell movement contributes to mer- 
istem size (Supplementary Notes and Extended Data Fig. 5f-h). 

We next performed simulations to analyse how cell-to-cell movement 
contributed to the PLT gradient. We first simulated PLT movement in 
the absence of growth and found that, for effective movement, PLT pro- 
teins needed to have slow turnover dynamics (Fig. 3c, simulated half- 
life of ~ 16 h; see Supplementary Notes and Supplementary Methods). 
Next, we reinstated root growth. Interestingly, the model predicted that 
slow PLT turnover in itself substantially contributes to the spread of 
PLT protein through growth dilution (Fig. 3d). 

These new data and previous findings’ about a regulator of PLT 
stability highlight a role for protein stability in gradient formation. The 
previously reported similarity between translational and transcriptional 
reporter fusion gradients’ may therefore rather be explained by similar 
stability of the PLT proteins fused to fluorescence reporters or reporters 
on their own. To test the influence of protein stability on gradient for- 
mation, we used stable and labile proteins fused to the YFP reporter. 
Histone 2B (H2B), a component of nucleosomes, was used as a stabi- 
lizing protein tag, while CYCB1;1, which is degraded from anaphase to 
S phase” in rapidly dividing meristem cells, was employed as a labile 
protein tag. When driven by the PLT2 promoter, H2B-YFP displayed 
fluorescence well into the differentiation zone with a shallow gradient, 
whereas CYCB1;1-YFP was only present in a punctuate pattern close 
to the stem cell niche (Fig. 3e). Our data imply that PLT genes are tran- 
scribed proximal to the stem cell niche, in line with our model predic- 
tions, and that retention of PLT proteins in more shootward cells 
depends critically on their stability. By crossing pPLT2:CYCB1;1-RFP 
with pPLT2:PLT2- YFP, we estimated that the PLT2 transcription domain 
encompasses approximately one-third of the visible PLT2 protein gra- 
dient (Fig. 3f). A subset of the cells in the remaining two-thirds of the PLT2 


©2014 Macmillan Publishers Limited. All rights reserved 


PAHP6:XVE 
>>PLT2-YFP 


PAHP6:XVE 
>>PLT2-3xYFP. 


T 1 T 
— Fast turnover, no movement 


| 
a 
= '60P — Fast turnover, movement 
bear Q —— Slow turnover, movement 
3 
72 
=e 
2s 40 
o 2 
as 
= 
5 2 
Oo 
2s 20 
oe 
Bn 
= 
oe 
oa 
3 0 
i= 0 300 600 900 


Distance from root tip (tum) 


Q 


1 T T 
— Fast turnover, no movement 
== Slow turnover, no movement 


a 
Oo 
T 


e 
& 
ie: g == Slow turnover, movement 
= Oo ze me 
Be 60 
> 2 
@ 2 
3.2 40 40 
<5 
oe 
of 
a, 20 150 300 450 
aa 
OES 
a 
[o) 
2 0 
iS 
ae 0 300 600 900 


Distance from root tip (tum) 


Number of cells 


0+ 
Ctrl 


Figure 3 | Gradient formation by PLT2 cell-to-cell movement and mitotic 
segregation. a, AHP6 promoter-erGFP fusion is consistently active in two 
vascular strands (and occasionally in columella). PLT2-YFP driven under 
inducible AHP6 promoter spreads from its transcription domain, especially in 
the stem cell region, whereas the movement-deficient version, PLT2-3X YFP, is 
predominantly confined to the AHP6 transcription domain, although weak 
signal resides in the stem cell region and between the two vascular strands. 
b, PLT2-RFP moves from clone (marked with GFP) to neighbouring cells. 
Arrowheads indicate recently divided nuclei. c, d, Influence of PLT cell-to-cell 
movement and turnover dynamics on vascular PLT protein profiles (main 
graph) and transcription profiles (inset) in the initial model in the absence 


gradient underwent mitosis, indicating that cells containing PLT2 protein 
but not transcribing PLT2 themselves are still capable of dividing (Sup- 
plementary Notes and Extended Data Fig. 5i). Our modelling predicted 
that besides cell-to-cell movement, growth dilution of PLT2 by cell 
division also has a role in the formation of the gradient. To test this, we 
blocked cell division using IAA (Supplementary Notes), hydroxyurea 
(HU)” or by removing the shoot* in pPLT2:CYCB1;1-RFP X pPLT2: 
PLT2-3X YFP double reporter lines. We discovered that while the PLT2 
transcription domain remained essentially unaltered, the domain only 
containing PLT2-3 x YFP protein was reduced (Fig. 3g, h), confirming 
a role for growth dilution in gradient formation. 

In our simulation model, the incorporation of both root growth and 
PLT intercellular movement with realistic parameter values was neces- 
sary to generate a smooth PLT gradient capable of dosage-dependent 
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(c) or presence (d) of growth. e, The stability of reporter protein fusion 
determines the expression pattern driven by the PLT2 promoter. Stable H2B- 
YFP extends into the differentiation zone, whereas labile CYCB1;1-YFP is 
confined to the meristem. f, PLT2 transcription is in proximal meristem 
only (red), whereas PLT2 protein (green) resides in the whole meristem. 

g, h, Division inhibition by shoot removal (“No shoot’), HU and IAA 
treatments shorten PLT2-3X YFP gradient. The number of visible YFP-only 
cells between two arrowheads in g are presented in h. Ctrl, control. n = 7 roots 
for all treatments, except 6 roots for IAA. Error bars show standard deviation 
(s.d.). **P = 0.01, two-way ANOVA with Bonferroni correction. Scale bars, 
50 um, except in b, 10 im. 


control of root zonation similar to our experimental observations (Fig. 4a 
and Supplementary Video 2). Interestingly, a similar gradient-forming 
mechanism functions in vertebrate axial patterning. There, polarized 
growth creates a gradient of stable FGF8 messenger RNA, with diffusion- 
mediated spread of the FGF8 protein smoothing and further extending 
the protein gradient™, and FGF8 itself controlling the growth process”. 
This regulatory architecture, in which growth controls gradient for- 
mation and gradient formation controls growth, has been suggested as 
a robust means to coordinate growth and patterning in polar growing 
tissues”, possibly explaining why it evolved independently in both plants 
and animals. 

Previous studies have suggested roles for auxin in cell division, ex- 
pansion and differentiation. However, the role of auxin in these pro- 
cesses could only be indirect, through regulation of PLT levels. To test 
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Figure 4 | Root zonation under normal growth and gravitropism. 

a, Zonation dynamics in the PLT-spread model under normal growth 
conditions. Snapshots of the auxin distribution (Aux), PLT transcription (PLT), 
PLT protein (PLT), division rate (DivR, measured as number of divisions per 
cell per hour), cell expansion rate (ExpR, measured as growth (um) per unit 
tissue (tum) per hour), differentiation level (Diff) and zonation dynamics (Zon) 
profiles. b, Root zonation dynamics in the gravitropism model under dynamic 
gravitropism. Left, snapshots of auxin, PLT transcription, PLT protein, 
expansion rate and zonation for leftward oriented gravity vector. (For 
downward and rightward oriented gravity vector see Extended Data Fig. 9b.) 
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this hypothesis, we next investigated whether there is also a direct role 
for auxin in controlling root zonation dynamics. To focus on direct effects 
of auxin, we considered short timescales insufficient to lead to changes 
in PLT expression. Auxin addition, application of the auxin antagonist 
auxinole”, and inhibition of auxin signalling by inducing the stable 
ARF-signalling repressor axr3-1 (ref. 28) experiments all confirmed 
that auxin rapidly regulates all zonation processes. Cell division and ex- 
pansion rates depended on optimum auxin levels, with different thresh- 
olds, whereas differentiation required a minimum level of auxin (see 
Supplementary Notes, Supplementary Videos 3, 4 and Extended Data 
Figs 6 and 7). Our computational model could readily be extended with 
these auxin-dependent rates (‘auxin model’), reproducing both normal 
zonation and the experiments described earlier (see Supplementary Notes, 
Supplementary Methods and Extended Data Fig. 8). 

Thus, our study uncovered a regulatory architecture in which auxin: 
(1) rapidly influences rates of developmental processes within zones 
without directly affecting PLT levels (minutes to hours timescale); and 
(2) influences the size and location of differentiation zones slowly through 
regulating PLT transcription (timescale of days). A subtle coupling bet- 
ween these processes occurs because auxin influences PLT growth dilu- 
tion through division and expansion rates (timescale of hours) and hence 
the location of the division and expansion thresholds (Extended Data 
Figs 2c, 6a, 8c, Supplementary Notes and Supplementary Methods). 
The coexistence of slow, PLT-mediated and rapid, direct auxin effects 
on zonation made us wonder why such an elaborate control system has 
evolved. To investigate this, we analysed gravitropism, an auxin-mediated 
process operating at a faster timescale than the generation of the PLT 
gradient. Gravity stimuli drive PIN protein reorientation-mediated asym- 
metric auxin accumulation on the lower side of the root within 5 min 
(refs 13, 14), causing inhibition of cell expansion, and bending of the 
root towards the new gravity vector within 6h (refs 13, 14) (Fig. 4c). 
When PIN protein reorientation caused by alternating gravitropic stim- 
uli was simulated in our model (‘gravitropism model’, Fig. 4b), elevated 
auxin levels alternated from left to right in the root and induced the 
differential expansion that drives root bending, while PLT levels stayed 
constant (Fig. 4b, Supplementary Video 5 and Extended Data Fig. 9a—d). 
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Right, dynamics of left-right differences in auxin, differentiation level and PLT 
protein distribution. Depicted are the used columella PIN orientation patterns, 
the applied 12 h cycle of PIN orientation changes, and the resulting auxin, 
differentiation level and PLT left-right distribution differences (see 
Supplementary Methods). t indicates the time point at which the snapshot was 
taken. c, DR5 and PLT expression in the same root (top) after gravitropic 
stimulation resulting in left-right difference in appearance of the first root hair 
(bottom). Arrows with ‘g’, gravity vector; white arrowheads, individual cells in 
the elongation zone; white arrows, youngest root hairs. Scale bar, 50 jum. 


The predicted constant PLT levels were confirmed experimentally 
(Fig. 4c). Thus, this regulatory design allows for a partial separation 
of timescales that enables rapid auxin-mediated tropic responses, essen- 
tial for sessile plants to respond to environmental challenges, while main- 
taining stable PLT-mediated developmental zonation (Extended Data 
Fig. 10a—c and Supplementary Discussion). If, in contrast, as was prev- 
iously thought, PLT expression were a relatively direct and proportion- 
ate readout of auxin levels, both auxin and PLT patterns would fluctuate 
under tropisms, resulting in variable zonation patterns and loss of co- 
ordinated differentiation (Extended Data Fig. 10d, eand Supplementary 
Discussion). 

Weuncover the auxin-PLT networkas a core module on which other 
factors, such as other phytohormones (for a review, see ref. 29), can act 
to regulate growth. Our study prompts two directions for future explo- 
ration. First, recently uncovered positive feedbacks from PLT back to 
auxin biosynthesis and transport”'*”° do not notably affect the beha- 
viour of our model (Extended Data Fig. 9e-g). We speculate that these 
feedbacks may havea role only during the generation of new primordia, 
when robust, localized auxin and PLT maxima need to be established. 
Second, the dominant role of PLT gradients in controlling zonation 
dynamics challenges the role of an auxin gradient as a dose-dependent 
instructive signal. Indeed, recent studies suggest that the auxin profile 
may not be a simple gradient". While our results support a role for 
auxin levels in zonation, they leave undecided whether a specific gra- 
dient-shaped auxin distribution is required. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Extended Data Figure 1 | PLTs are dose-dependent drivers of zonation. 

a, b, The domain of frequent cell division, monitored by cell cycle marker 
CYCB1;1-GFP in Fig. 1b, c, shifts shootward with increased PLT2 dosage (that 
is, homozygote pPLT2:PLT2-YFP in Col background). Histogram in a shows 
the distribution of the CYCB1;1-GFP-positive cells along the meristem at a 
given distance from the quiescent centre. x axis indicates the distance from the 
quiescent centre as number of cortical cells, and y axis label “GFP density’ refers 
to the proportion of CYCB1;1-GFP-containing cells at the given distance from 
the quiescent centre. Shootward shift of the distance of the cell division events 
in the presence of increased PLT2 (green histogram) dosage compared to wild- 
type (red histogram) is significant (t-test for mean P < 0.001, Wilcoxon test for 
median P< 0.001, Kolmogorov-Smirnov for difference of distributions 
P<0.001). b, Histogram presenting rescaled data to show that the distribution 
of the high cell division domain shifted shootward when PLT2 dosage was 
increased. A null hypothesis was that shootward shift is due to higher 
dispersion of the distribution observed under increased PLT2 dosage. To test 
this hypothesis, the control CYCB1;1-GFP data were rescaled to match the 
maximum values of PLT2 data. The null hypothesis was rejected (t-test 


P= 0.001, Wilcoxon test P = 0.0012, Kolmogorov-Smirnov P = 0.0026), 
indicating that the shootward shift of the high division domain in the presence 
of increased PLT2 dosage is significant, and not due to dispersion. The bin 
width in histograms is two cells (that is, 1st bar, 1 and 2 cells; 2nd, 3 and 4, and 
so on). ¢, Induction of pAHP6:X VE>PLT2-YFP inhibits xylem differentiation 
(left) (white arrow indicates the first protoxylem element) and triggers ectopic 
cell divisions illustrated by CYCB1;1-GUS activity (right) (arrowheads), 
whereas root hairs develop normally (yellow arrows). Insets show 
magnifications from the designated areas. d, PLT2-YFP induction (t = 0) 
rapidly inhibits root growth (top) and cell expansion (bottom) in three roots. 
Expansion rates as shown as averages + s.d. e, Inhibition of growth coincides 
with appearance of PLT2-YFP signal after induction. f, Induction of PLT2 
RNAi (in p/t1,3,4; pPLT2:PLT2-YFP) abolishes the PLT2-YFP signal by 49h 
and consequently promotes expansion and differentiation of the meristem 
cells, as indicated by the appearance of expanded cells as well as protoxylem 
(arrow) and root hairs (arrowhead) in the meristem. Images from the same root 
using identical confocal microscopy settings for the yellow channel. 

Scale bars, 50 um. 
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IAA+NPA 


Extended Data Figure 2 | PLT expression patterns respond only to long- 
term auxin accumulation in the meristem. a, PLT expression shifts 
shootward only when prolonged auxin application is accompanied with polar 
auxin transport inhibitor (NPA) treatment. Four-day-old seedlings were 
transferred to an agar plate containing 20 1M NPA plus 5 uM IAA for the time 
periods indicated in the images. The expression of pPLT1:PLT1-YFP, 
pPLT3:PLT3-YFP and pPLT4:PLT4-YFP spreads shootward (white arrows) by 
72h of NPA plus IAA treatment. b, PLT expression patterns are insensitive for 
auxin-only treatments. Four-day-old seedlings were transferred to an agar 
plate containing 5 uM IAA for the time periods indicated in the images. The 
auxin response reporter DR5:erGFP rapidly responded to the treatment 
whereas the expression domains of pPLT1:PLT1-YFP, pPLT2:PLT2-YFP, 
pPLT3:PLT3-YFP and pPLT4:PLT4-YFP failed to expand. Black arrow 
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indicates the region in meristem that is absent of DR5:erGFP fluorescence after 
IAA treatment but is filled with fluorescence after NPA plus IAA treatment 
(Fig. 2a). Observed phenotypes/number of roots analysed are indicated in the 
right bottom corners. c, Twenty-four hours of NPA plus IAA treatment fails to 
expand the PLT2-YFP gradient shootward. In fact, the treatment leads to 
transient shortening of the PLT2-YFP gradient, probably due to inhibition of 
growth dilution of PLT2-YFP in the meristematic cells (see Fig. 3g, h). 
P<0.001, Kolmogorov-Smirnov test; error bars indicate 95% confidence 
intervals. n = 20 (dimethylsulphoxide (DMSO)) and 23 (NPA plus IAA). 

d, Graded pPLT2:PLT2-YFP expression despite shallow auxin gradient in 
aux1,ein2,gnom; representative image from three independent lines. 

Scale bars, 50 um. 
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Extended Data Figure 3 | Composition of the zonation model. a, Overview 
of the initial, PLT-spread, auxin and closed feedback models. Relationships 
between model variables (grid-based auxin levels, cell-based PIN and PLT 
levels), growth processes (stem cell maintenance, division, expansion and 
differentiation), PIN-mediated auxin transport, cell-to-cell PLT movement and 
their emerging consequences for auxin and PLT levels are shown, with different 
colours indicating from which model onward they are incorporated (models 
are in order of increasing complexity). Variables are indicated in rectangles, 
processes are indicated in ovals. A distinction is made between cell-level and 
tissue-level processes. Direct regulatory interactions are indicated with arrows, 
and a distinction is made between fast to intermediate speed versus slow 
interactions, and between the levels of the ‘input’ variable needed for a 
particular process to occur. Emergent feedbacks, with processes influencing 
variable levels other than through a direct regulatory effect (for example, 
growth spreading PLT and hence influencing its levels) are indicated as dashed 
arrows. b, Spatially explicit overview of the interplay between the variables and 
processes and how these generate the auxin and PLT gradients and control 
root zonation dynamics in the auxin model. Stem cells and slow division 
(STEM), fast division (DIV), expansion (EXP) and differentiation (DIF) 
processes are indicated in black if the local auxin and PLT levels permit these 
processes, and in grey if the local auxin and PLT levels prevent these processes 
from occurring. Similarly, black arrows indicate that local auxin or PLT levels 
cause promotion or repression of a process, grey arrows indicate that auxin 
or PLT in principle has a particular effect on a process but that local levels do 
not allow for this effect to occur. Blue arrows indicate transcriptional effects. 
The broad dark blue arrow indicates PIN-mediated auxin transport. c, Layout 
of the root tissue and PIN distribution pattern used in the simulations. The 
simulated tissue contains columella cells (cyan), quiescent centre cells (grey), 
epidermal cells (blue), cortical cells (green), border cells (yellow) and 
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vasculature cells (red). For the PIN distribution pattern, four levels are 
distinguished: no PINs, a maximum relative level of 0.1, 0.35, and 1. These 
maximum relative levels are multiplied with the cellular PIN protein level to 
determine a membrane segment’s actual PIN level. The rootward-shootward 
flip of PIN polarity in cortex cells is indicated with arrows. d, Time course 

of free ARF levels, and protein levels of transcription factors A, B, C, D and PLT 
under a constant auxin application (level 100). Left, fast PLT protein turnover; 
right, slow PLT protein turnover. e, Schematic overview of root zonation 

and its dependence on PLT and auxin gradients. Top panel, location of the 
meristem (MZ), expansion (EZ) and differentiation (DZ) zones. Second panel, 
PLT concentration profile along the length of the root. PLT levels dictate 

the location of stem cell maintenance and slow division, fast division and 
expansion and differentiation domains, with the first threshold demarcating 
the boundary between stem cell and fast division domains and the second 
threshold demarcating the boundary between fast division and expansion and 
differentiation domains. Third panel, auxin profile along the root. Fourth panel, 
dependence of division, expansion and differentiation rates on auxin levels. 
A,B,C and D correspond to different auxin levels; for comparison purposes it is 
shown where these levels occur both in the auxin gradient profile (third panel) 
and where these levels occur in the division, expansion and differentiation 
auxin rate dependency functions (fourth panel). f, Pseudocode of the algorithm 
used in the program to determine whether cells will grow, divide, expand or 
differentiate. In addition, a cartoon version of the consequences of growth, 
division, expansion and differentiation processes on gene expression levels, 
differentiation levels, cell size and PIN pattern is shown. g, Schematic depiction 
of the neighbouring cells and weighting factors used to calculate weighted, 
locally averaged growth and expansion rates. B, boundary cells; C, cortex cells; 
E, epidermal cells; V, vascular cells. 
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Extended Data Figure 4 | The requirement of high auxin levels produces 
narrow, non-graded PLT profiles. a, Vascular auxin concentration profiles 
under root zonation dynamics in the initial model for different half-saturation 
values for free ARF. Note: the curves are practically superimposed. A snapshot 
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image of the zonation below the graph illustrates the location of the root zones. 
Columella (C), meristem (MZ), expansion (EZ) and differentiation (DZ) zones 
are shown. b, Vascular PLT transcription (continuous lines) and protein 
profiles (dashed lines) for the same parameter settings as in a. 
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Extended Data Figure 5 | PLT2 protein persistence and mobility maintain 
meristematic characteristics without elevating auxin response. a, A single 
cell clone expressing PLT2-RFP exiting the meristem (0 min) and travelling 
through the expansion zone towards the differentiation zone (200 min and 
340 min). b, Magnification of the marked area in a (0 min), demonstrating that 
the clone (marked with green fluorescence, appearing as yellow when 
overlapped with PLT2-RFP red fluorescence) consists originally of a single cell 
(white arrowheads). Note that PLT2-RFP (red nuclear fluorescence) is present 
both in the clone and the surrounding cells. Inset shows optical cross-section at 
the position of the clone. c, Magnification of the marked area in a (340 min) 
showing that the clone has divided once while being in the expansion zone 
(arrowheads mark two clonal cells), and that PLT2-RFP-expressing cells do not 
expand, whereas cells produced before and after generation of the clone 

have expanded. d, Auxin response sensor DR5:nYFP (yellow nuclear 
fluorescence)*” is not elevated in the PLT2-RFP cells (white arrowheads) but 
shows normal response in vasculature (white arrows). e, Anti-auxin, 


31. Heisler, M. G. et al. Patterns of auxin transport and gene expression during 
primordium development revealed by live imaging of the Arabidopsis 
inflorescence meristem. Curr. Biol. 15, 1899-1911 (2005). 
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a-(phenylethyl-2-oxo)-IAA (PEO-IAA) inhibits root hair formation, but fails 
to promote cell expansion in the PLT2-RFP clones. A PLT2-RFP clone exiting 
the meristem (0h) and travelling through the elongation zone towards the 
differentiation zone. PEO-IAA (30 LM) was applied to the medium 2h after 
taking the first (0h) image. Then images were taken 2.5h, 4h, 6h and 20h 
after PEO-IAA application. Note: root hair production is inhibited after 
PEO-IAA application. Inset, magnification of the marked area in the 0 h image, 
showing the clone (marked with green fluorescence) and that PLT2-RFP (red 
nuclear fluorescence) is present both in the clone and the surrounding cells. 
f, PLT2-3X YFP shows reduced expression in the stele. g, h, The movement- 
deficient version, PLT2-3X YFP, complements the p/t1,2 mutant, although the 
meristem is shorter than when PLT2-YFP is used. Seedlings were 7 days old; 
arrowheads, MZ boundary. Asterisks in h, Wilcoxon test (P < 0.001); meristem 
size of PLT2-3X YFP lines significantly reduced. Error bars show s.d. i, Cells 
shootward from the stem cell niche are proliferating without PLT2 
transcription (arrowheads, cells with GFP but no RFP). 
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Extended Data Figure 6 | High auxin levels rapidly inhibit cell division and 
expansion, but not differentiation. a, First signs of expansion zone 
differentiation 7 h after 5 uM IAA application marked by appearance of 
protoxylem elements (arrow). By 24h ubiquitous differentiation of the 
expansion zone is evident. 0 h image used from Fig. la. Scale bar, 50 um. 

b, Auxin application rapidly inhibits growth while xylem differentiation, 
monitored by green fluorescence of $18 marker*’, proceeds towards meristem. 
Snapshots from a video of the same root before and after application. Right 
panel, S18 signal is tightly associated with protoxylem differentiation (arrow). 
c, Root growth (um h ~ ‘) and mitosis (below the x axis) of two roots over time 
(min). [AA applied at t = 0. d, e, Application of 5 uM IAA and inhibition of 
auxin signalling by 30 1M PEO-IAA leads to decreased accumulation of 
5-ethynyl-2’-deoxyuridine (EdU) stain (red fluorescence), marking DNA 
replication. Asterisks in e, Mann-Whitney U test P< 0.05, after Bonferroni 
correction of multiple comparisons; reduction of number of EdU-stained 
nuclei compared with DMSO control. Error bars show s.d. f, Application of 
moderate levels of IAA (30 nM) still inhibited cell expansion (Supplementary 
Notes) but did not inhibit cell division. Root growth (um h ') and mitotic 


32. Lee, J. Y. etal. Transcriptional and posttranscriptional regulation of transcription 
factor expression in Arabidopsis roots. Proc. Nat! Acad. Sci. USA 103, 6055-6060 
(2006). 
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events (below the x axis) of two roots over time (min). IAA (30nM) was 
applied at t = 0. g, h, To measure the duration of the differentiation process, 
individual cells were followed as they left the meristem, expanded and entered 
the differentiation zone. g, Tracking of a GFP clone” consisting of four cells. 
Arrows highlight a cell just entering the expansion zone in the first panel 
and in the last panel the same cell entering the differentiation zone. For this 
particular cell it took approximately 6 h 45 min to travel through the expansion 
zone. Six clones located in six roots were followed through the expansion zone, 
and it took 6-8 h for these clones to travel through the expansion zone. 

h, Snapshots from a video recording the growth of wild-type root in the 
presence of 30nM IAA. The cells entering the expansion zone (arrows in left 
panel) were traced in the video to record the time it takes to enter the 
differentiation zone (arrows in the right panel). For the marked cell it took 
approximately 7h 10 min to travel through the expansion zone. Tracking of 
cells through the expansion zone was carried out for nine cells located in 
three different roots, and it took 6-8 h for these cells to travel through the 
expansion zone. 
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Extended Data Figure 7 | Auxin is required for cell division, expansion and 
differentiation. a, Protoxylem differentiation is inhibited in the axr3-1 
mutant. First protoxylem is highlighted with an arrow. Higher up in the root 
protoxylem differentiation is often sporadic (inset, arrows indicate a stretch 
of a single protoxylem element). b, Twenty-four hour induction of 
pG1090:XVE>>axr3-1-RFP with 17B-oestradiol (est) inhibits xylem 
differentiation and root hair outgrowth. Left inset, a single protoxylem vessel 
highlighted with two arrows. Right inset, the arrow highlights the beginning ofa 
more continuous protoxylem strand, which was probably already present 
before the induction of axr3-1-RFP. Yellow bars in b, c show the areas in the 
root in which visible protoxylem are present. Yellow arrow in b, c marks the 
first root hair. Est, 5 tM 17B-oestradiol. c, Twenty-hour treatment of 4-day old 
wild-type root (Col) with 25 uM auxinole (auxin antagonist) inhibits xylem 
differentiation, cell expansion and root hair outgrowth. Xylem is typically 
differentiated as short, sporadic stretches. Left inset, arrow shows the end of a 
stretch of a xylem strand comprising approximately six protoxylem elements; 
right inset, higher up in the root two continuous protoxylem strands appear 
(arrows). These strands were probably already present before auxinole 
application. The middle inset shows the presence of short cells (marked with 
arrowheads) high up in the root, indicating that cell expansion was defective. 
The cell length typically varied along the root. d, Bar plot shows that the final 
length of cortex cells is shorter when auxin signalling is inhibited by auxinole 
(n = 35 cells in 6 roots) or inducible pG1090:X VE>>axr3-1-RFP (n = 47 cells in 


33. Cho, H. T. & Cosgrove, D. J. Regulation of root hair initiation and expansin gene 
expression in Arabidopsis. Plant Cell 14, 3237-3253 (2002). 
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6 roots) when compared with differentiation zone cells in the control roots 

(n = 51 cells in 7 roots and n = 38 cells in 7 roots, respectively) located at a 
similar distance from the root tip. Asterisks, Mann-Whitney U test (P < 0.001). 
Error bars show s.d. e-j, The consequence of auxin application or inhibition 
of auxin signalling on marker gene expression. e, f, Auxin application (5 1M 
IAA) rapidly leads to expression of the root hair differentiation marker 
pEXP7:GUS (ref. 33) (e) and the xylem differentiation marker S18 (ref. 32) (f) in 
the elongation zone. Note: high auxin levels (such as 5 uM IAA) are inhibitory 
for root hair elongation. Therefore, even though the root hair marker 
pEXP7:GUS rapidly spreads into the elongation zone, root hair elongation 

is less pronounced there. e-g, Twenty-four hour induction of 
pG1090:XVE>>axr3-1-RFP (est) inhibits the expression of root hair 
differentiation marker pEXP7:GUS (e) and the xylem differentiation marker 
S18 (ref. 32) (f), as well as cell division marker CYCB1;1-GUS” (g). Note: both 
the signal intensity and the number of CYCB1;1-GUS-expressing cells are 
decreased in g. Twenty-four hour induction of XVE>>axr3-1 leads either to 
disappearance of S18 fluorescence (no expression in f), or $18 was present in 
short fragments (fragmented). h, i, Expression of PLT2-YFP continued to mark 
the shortened meristem after auxin signalling was inhibited by induction of 
pG1090:X VE>>axr3-1-RFP (est) (h) or by treatment of the seedlings with 

25 UM auxinole (axnl) (i). j, Auxinole rapidly inhibits cell division as indicated 
by reduction of the number of CYCB1;1-GFP-expressing cells after auxinole 
treatment. 
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Extended Data Figure 8 | Simulation of zonation dynamics in the auxin 
model under normal conditions and conditions of perturbed auxin 
(signalling). a, Zonation dynamics under normal growth conditions. 

b, Zonation dynamics after 24h of intermediate level auxin application. 

c, Zonation dynamics after 24h of high level auxin application. d, Zonation 


PLT 


Zon Aux PLT PLT Zon Aux PLY PLT Zon 


dynamics 60h after shoot cut. e, Zonation dynamics after 24h of inhibited 
auxin signalling. Shown are snapshots of auxin (Aux), PLT transcription (PLT), 
PLT protein (PLT) and zonation (Zon) profiles. In addition, for the normal 
growth conditions, snapshots of division rates (DivR), expansion rates (ExpR) 
and differentiation levels (Diff) are shown. 


©2014 Macmillan Publishers Limited. All rights reserved 


a columella PIN orientation dynamics 


leftward PIN3 apolar PIN3 — rightward PIN3 


t=3.5h 
era COO oo 
iddl : : 
i - ke 9 24 30) 36 | high [fj intermediate [J low membrane PIN level 
ng aoe (h) 
t=9.5h 
t=3.5h t=5.5h t=9.5h 
| 
y 
| 
Lt 
i | ae 
? Tree] PEPER? 
Aux PLT PLT ExpR Zon Aux PLT PLT ExpR Zon Aux PLT PLT ExpR Zon 


SES 


min level max 
PINtot, PINmem, SE and DE: 0- 


© normal root zonation dynamics 


LETTER 


c t=3.5h t=5.5h t=9.5h 
id 
Le 
fs =| 
Pie FAR aa HAA A 
Aux PLT PLT ExpR Zon PLT ExpR Zon Aux PLT PLT ExpR Zon 
d 5 400 =e , 
Q — Diff 
S$ 200 
ro) 
2 0) 
& -200) 
% -400 
0 6 12 18 24 30 
Time (h) 
f after 24 h of high 9 under gravitropic 
a auxin application stimulation 


PINtot 
PINmem 


SE DE 


Aux PLT PLT DivR ExpR Diff Zon 
Extended Data Figure 9 | Simulation of zonation under a dynamic 
gravistimulus protocol and in the closed feedback model. a, Left, 12 h period 
in which leftward, apolar and rightward columella PIN orientations are 
interchanged to simulate dynamic gravitropism. Right, schematic depiction of 
the used leftward, apolar and rightward columella PIN orientations. b, Root 
zonation dynamics for the gravitropism model. Snapshots of auxin, PLT 
transcription, PLT protein, expansion rate and resulting zonation dynamics are 
shown for t = 3.5h when PIN orientation is leftward (left), at t= 5.5h when 
PIN orientation is apolar (middle) and at t = 9.5h when PIN orientation is 
rightward (right). c, Root zonation dynamics for the simplified gravitropism 
model. In the simplified gravitropism model, cellular division and 
differentiation rates are again constant (as in the minimal model) rather than 
ARF level dependent (as in the auxin and normal gravitropism models). Only 
expansion rates are ARF level dependent, such that they decrease from their 
maximum value for higher than optimal ARF levels. Similar snapshots as in 
b are shown. d, Dynamics of left-right differences in auxin, differentiation 
level and PLT protein distribution in the simplified gravitropism model. 
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e-g, Simulations with positive feedbacks from PLT back to auxin biosynthesis 
and transport. e, Zonation dynamics under standard growth conditions. In 
addition to the panels shown for other model versions, gene expression patterns 
of the genes dependent on PLT levels are shown. PINtot refers to total cellular 
PIN levels, PINmem to membrane PIN levels, SE to a general auxin 
synthesizing enzyme and DE to a general auxin degrading enzyme. Note that 
membrane PIN levels are a product of cellular PIN protein levels and the 
superimposed cell type and zone dependent membrane PIN pattern (which 
determines the locations and ratios of PINs deposited on the different 
membrane faces of the cell). f, Zonation dynamics after 24h of high auxin 
application. g, Zonation dynamics under dynamic gravitropic stimulation. For 
comparison purposes arrows indicating the location of the transitions from MZ 
to EZ (bottom arrow) and from EZ to DZ (top arrow) as found in the PLT- 
spread model are shown. For the gravitropism simulation (g) the EZ to DZ 
transitions at both the lower (middle arrow) and upper (top arrow) side of the 
root are shown. 
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Extended Data Figure 10 | Structure and function of the auxin- 
PLETHORA regulatory architecture. a, Regulatory architecture controlling 
root zonation dynamics and tropisms. Slow induction of the PLTs by auxin 
(black arrows) defines the pathway that operates through regulating PLT levels 
(green arrows). Parallel to this, auxin can also control zonation rapidly without 
directly affecting PLT levels (blue arrows). b, Schematic overview of root 
developmental zones where local concentrations of auxin, PLT transcript and 
PLT proteins are represented by symbol density. c, Overview of the auxin, PLT 
transcript and PLT protein profiles and corresponding zonation dynamics 
under the following conditions: wild-type (WT), extra PLT2 copy in wild type 
(PLT2:YFP), clonal ectopic expression of PLT in the expansion zone (clone), 
short-term auxin addition (+ aux) or inhibition of auxin signalling (—aux). 
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d, e, Auxin, PLT, zonation, and expansion rate profiles during (d) and after (e) a 
gravitropic stimulus for a simulation in which PLT levels are a direct readout of 
auxin levels, and in which partly differentiated cells dedifferentiate upon re- 
entering the meristem. Differentiation snapshots are shown with 2 h intervals 
during and after the gravitropic stimulus. Brackets highlight the developmental 
progression of the cells that dedifferentiated under the gravitropic stimulus. 
Differentiation graphs show differentiation levels in the leftmost epidermal row 
of cells (corresponding to the side of the root towards the gravity vector) in the 
PLT as direct auxin readout model (black) compared to that of the model 
developed in this study (red), for 2 h after the (end of the) gravitropic stimulus. 
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Cessation of CCL2 inhibition accelerates breast 
cancer metastasis by promoting angiogenesis 


Laura Bonapace” 
& Mohamed Bentires-Alj'* 


Secretion of C-C chemokine ligand 2 (CCL2) by mammary tumours 
recruits CCR2-expressing inflammatory monocytes to primary tu- 
mours and metastatic sites, and CCL2 neutralization in mice inhi- 
bits metastasis’ by retaining monocytes in the bone marrow. Here we 
report a paradoxical effect of CCL2 in four syngeneic mouse models 
of metastatic breast cancer. Surprisingly, interruption of CCL2 inhibi- 
tion leads to an overshoot of metastases and accelerates death. This is 
the result of monocyte release from the bone marrow and enhance- 
ment of cancer cell mobilization from the primary tumour, as well as 
blood vessel formation and increased proliferation of metastatic cells 
in the lungs in an interleukin (IL)-6- and vascular endothelial growth 
factor (VEGF)-A-dependent manner. Notably, inhibition of CCL2 
and IL-6 markedly reduced metastases and increased survival of the 
animals. CCL2 has been implicated in various neoplasias and adopted 
as a therapeutic target’ *. However, our results call for caution when 
considering anti-CCL2 agents as monotherapy in metastatic disease 
and highlight the tumour microenvironment as a critical determin- 
ant of successful anti-metastatic therapy. 

Most breast-cancer-related deaths are caused by metastases in vital 
organs. The tumour microenvironment is key for cancer growth, dis- 
semination and metastasis*®. A high number of intratumoral macro- 
phages correlates with poor prognosis’”-'* and macrophage infiltration 
in breast cancer correlates with high expression of the monocyte 
chemoattractant CCL2. CCL2 has been proposed as a target for metas- 
tatic breast cancer because high expression of CCL2 correlates with a 
decrease in survival of breast cancer patients'*’"'*"* (Extended Data 
Fig. 1a, b), and because monocytes expressing the CCL2 receptor enhance 
metastasis via VEGF secretion in mice”. 

Weassessed the effect of CCL2 neutralization on tumour growth and 
metastasis in syngeneic mouse models of CCL2-secreting metastatic 
breast cancer (Extended Data Fig. 1c-e). Although anti-CCL2 treatment 
had no effect on primary tumour growth, it reduced the number of lung 
metastases (Extended Data Fig. 2a—c) and circulating tumour cells (CTCs) 
(Extended Data Fig. 2d). Intravital imaging'® showed that anti-CCL2 
treatment reduced cancer cell motility and blood vessel leakiness in the 
tumour, and we found more pericytes around blood vessels (Extended 
Data Fig. 2e—g and Supplementary Videos 1, 2). Decreased blood vessel 
leakiness upon anti-CCL2 treatment correlated with fewer CTCs and 
reduced intratumoral macrophage numbers (Extended Data Fig. 3a, b). 
Therefore, CCL2 neutralization limits metastases not only through effects 
on pre-metastatic niches’ but also by limiting cancer cell intravasation. 

Next we examined the persistence of the anti-metastatic effect of CCL2 
neutralization after treatment was discontinued. The antibody was cleared 
within 10 days after treatment (Extended Data Fig. 3c), leading to a re- 
bound of CCL2 in the lungs (Extended Data Fig. le). Surprisingly, we 
found that cessation of anti-CCL2 treatment accelerated death (Fig. 1a). 
Ten days after interruption of anti-CCL2 treatment, we found a dra- 
matic increase in lung and liver metastases and an increase in CTC 
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numbers (Fig. 1b-d and Extended Data Fig. 4a, b). Thus, although anti- 
CCL2 treatment reduced metastases, interruption of the treatment aggra- 
vated metastasis when compared to controls. However, the anti-metastatic 
effect persisted when animals were treated continuously with anti-CCL2 
antibody (Extended Data Fig. 4c, d). 

Since Ly6C-positive monocytes respond to CCL2, we assessed this 
paradoxical effect of CCL2 by focusing on the chemoattraction between 
breast cancer cells and SSC'°“CD11b *Ly6C* monocytes. We found that 
CCL2 drives mutual attraction of monocytes and tumour cells in vitro 
(Extended Data Fig. 5a—c). Monocytes were sequestered in the bone 
marrow during anti-CCL2 treatment (Extended Data Fig. 6a, b, left). 
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Figure 1 | Discontinuation of anti-CCL2 treatment increases lung 
metastases and accelerates death of mice. a, Kaplan-Meier survival curves of 
mice treated with anti-CCL2 antibody or control IgG for 14 days. The primary 
tumour was removed on the last day of treatment and the animals were left 
untreated until first signs of distress. n = 12 mice per group (pooled data of 
two experiments); black box, one animal was censored. **P = 0.0001, log-rank 
test. b, Timeline of the in vivo experiments. c, Metastatic index in tumour- 
bearing mice on day 24. d, Number of colonies formed by CTCs per ml 
blood on day 24. e, Ratio of SSC’'“CD11b*Ly6C* monocytes in organs on day 
24 and on the last day of treatment (day 14). BM, bone marrow. f, Lung 
metastases were quantified as percentage of metastatic area per lung area (left) 
or as area per lung section (right) on five random sections per animal. 

g, Number of colonies formed by CTCs per ml blood. c-g, Data are shown as 
means + standard error of the mean (s.e.m.) of n = 8 mice per group, pooled 
data from two experiments. *P < 0.05, unpaired t-test. NS, not significant. 
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Ten days after treatment, monocyte numbers decreased in the bone 
marrow of animals treated with anti-CCL2 but increased in the primary 
tumour and lungs (Fig. le and Extended Data Fig. 6a-c, right). There- 
fore, monocytes were retained in the bone marrow during anti-CCL2 
treatment and were released after treatment. 

To assess monocyte distribution in tumours and metastases during 
and after CCL2 neutralization, we transferred sorted monocytes from 
bone marrow of 4T1.2-tumour-bearing mice into syngeneic tumour- 
bearing mice that had been treated for 7 days with anti-CCL2 or control 
immunoglobulin (Ig)G. During anti-CCL2 treatment, the transferred 
monocytes were retained in the bloodstream and their homing to the 
primary tumour and to the metastatic site was inhibited (Extended Data 
Fig. 7a). By contrast, 7 days after anti-CCL2 treatment cessation, trans- 
ferred monocytes migrated to the lungs (Extended Data Fig. 7b). 

To assess whether migration of monocytes into the primary tumour 
and lungs drives metastasis, tumour-bearing animals were transfused 
with monocytes from syngeneic tumour-bearing donors for 4 consecutive 
days. This enhanced lung metastases and numbers of CTCs (Fig. 1f, g) 
compared with controls. The data show that monocytes homed from 
the blood to the primary tumour or to the lungs in a CCL2-dependent 
manner and thus increased CTCs and lung metastases. 

We then asked whether monocytes mediate the metastatic overshoot 
after cessation of anti-CCL2 treatment via enhanced mobilization of 
cancer cells from the primary tumour or through direct promotion of 
metastatic growth in the lungs. We removed the primary tumour on 
the last day of anti-CCL2 treatment and assessed lung metastases 10 days 
later. Lung metastases were still increased even though CTCs were 
reduced (Extended Data Fig. 8a, b). Furthermore, the number of mono- 
cytes/macrophages within lung metastases roughly doubled but increased 
to a lesser extent in the metastases of controls (Extended Data Fig. 8c). 
This suggested that the growth-enhancing effect of monocytes in the 
lungs crucially contributed to metastatic overshoot. Indeed, monocytes 
augmented the proliferation of breast cancer cells in vitro (Extended Data 
Fig. 8d). Since this direct effect was only moderate, we surmised that 
monocytes may also affect metastatic growth through effects on the 
metastatic microenvironment. We detected increased intrametastatic 
vasculature and Ki67~ proliferating cells in metastases after anti-CCL2 
discontinuation (Fig. 2a, b). Therefore, the enhancement of metastasis 
by monocytes could be due to a growth-enhancing effect of monocytes 
on tumour cells and to increased vascularization of metastases. To fur- 
ther test this hypothesis, we treated mice with anti-CCL2 or IgG for 
14 days, followed by colony stimulating factor 1 receptor (CSF1R) inhib- 
itor or vehicle (Fig. 2c). Since treatment with CSF1R inhibitor depletes 
CSF1R-positive cells'” (Extended Data Fig. 8e), we concluded that mono- 
cyte depletion after anti-CCL2 treatment prevented metastatic over- 
shoot (Fig. 2d). Vascular density and proliferation of metastases were 
also reduced (Fig. 2e, fand Extended Data Fig. 8e). High CCL2 in human 
breast cancer biopsies also correlated with high vascularization (Extended 
Data Fig. 9a) and with a high frequency of CD68" macrophages (Extended 
Data Fig. 1a). We conclude that an enhanced number of monocytes/ 
macrophages in the lung after anti-CCL2 treatment cessation supports 
local vascular growth and proliferation of metastases. 

To delineate the molecular mechanism by which monocytes increase 
vascular density within lung metastases after anti-CCL2 treatment, we 
analysed cytokines in supernatants of mono- and co-cultures of cancer 
cells and bone marrow monocytes. Among the cytokines that were in- 
creased in tumour-monocyte co-cultures compared with monocultures 
(Extended Data Fig. 9b, c), IL-6 was the only one known to stimulate 
angiogenesis'*’. Although IL-6 was not elevated in the primary tumour 
after anti-CCL2, its expression in serum and metastatic lungs was higher 
than in controls (Extended Data Fig. 9d and Fig. 3a). In addition, IL-6 
induced expression of pro-angiogenic VEGF-A in monocytes ex vivo. 
This effect appeared to be CCL2 dependent (Fig. 3b). Consistently, ana- 
lysis of gene expression data sets for metastatic breast cancer revealed 
that concomitant high expression of IL-6 in tumours with high CCL2 
expression was associated with diminished patient survival, compared 
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Figure 2 | Monocytes induce blood vessel formation and proliferation of 
lung metastases after discontinuation of anti-CCL2 treatment. a, Left, 
representative images of CD31-stained lung metastases on day 24. Scale bar, 
50 um. Right, quantification of CD31 staining. b, Left, representative images 
of Ki67-stained lung metastases on day 24. Scale bar, 100 jum. Right, 
quantification of Ki67 staining. a, b, Graphs show means + s.e.m. of 20 fields of 
view on 5 lung sections per animal, n = 4 mice per group, pooled data from two 
experiments. **P < 0.001, *P < 0.05, unpaired t-test. One representative image 
out of 20 is shown per group. c, Timeline of in vivo experiments. CSF1Rinh, 
CSFIR inhibitor. d, Lung metastases were quantified as percentage of 
metastatic area per lung area (left) or as area per lung section (right). vhc, 
vehicle. e, Quantification of CD31 staining. f, Quantification of Ki67 staining. 
d-f, Graphs show means = s.e.m. of 10 fields of view on 5 lung sections per 
animal, n = 4 mice per group, pooled data from two experiments. *P < 0.05, 
analysis of variance (ANOVA) with post-hoc Bonferroni correction. NS, 

not significant. 


with concomitant low expression of IL-6 (Extended Data Fig. 9e). We 
conclude that high expression of IL-6 in the lungs subsequent to the 
cessation of anti-CCL2 treatment induces VEGF-A in monocytes and, 
at the same time, increases local vascular density. 

Since IL-6 appeared to drive blood vessel density in metastatic lungs, 
we treated mice with anti-CCL2 followed by anti-IL-6. We observed 
reduced lung metastases and CTCs and increased survival of the mice 
(Fig. 3c, d and Extended Data Fig. 9f). We then asked whether anti-IL-6 
decreased angiogenesis and tumour cell proliferation within metastases. 
Treatment of tumour-bearing animals with anti-IL-6 after anti-CCL2 
reduced the overshooting number of CD31" cells in lungs and VEGF-A 
levels in lungs and serum (Fig. 3e, f and Extended Data Fig. 9g). Con- 
sistently, the proliferation of metastatic tumour cells declined to unde- 
tectable levels (Fig. 3g). Next we sought to understand how IL-6 induces 
VEGF-A expression in vivo. STAT3 phosphorylation (p-STAT3) results 
in VEGF-A secretion downstream of IL-6 signalling”**'. Therefore we 
assessed the level of p-STATS3 in lung metastatic tissue. Consistent with 
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Figure 3 | Anti-IL-6 treatment after cessation of anti-CCL2 therapy 
prevents the overshoot of lung metastases. a, IL-6 expression in lung 
homogenates on day 14 and 10 days after anti-CCL2 treatment. Data are shown 
as means + s.e.m., n = 4 mice per group, pooled data from two experiments. 
*P < 0.05, **P < 0.001, unpaired t-test. b, Mean Vegfa messenger RNA 
levels in monocytes (Mono) from tumour-bearing mice treated ex vivo with 
IL-6 and/or anti-CCL2. Data show means + s.e.m., n = 3 mice per group, one 
representative of three independent experiments. *P < 0.05, ANOVA with 
post-hoc Bonferroni correction. c, Timeline of in vivo experiments. d, Left, 
quantification of lung metastases. Right, Kaplan-Meier survival curves of 
animals treated with control IgG, anti-CCL2, anti-IL-6 or anti-CCL2 plus 
anti-IL-6 as in Fig. 3c. On day 24, the animals were left without treatment until 
the appearance of signs of distress. n = 10 mice per group. P = 0.0017 by 

the log-rank test comparing anti-CCL2 with anti-CCL2 plus anti-IL-6 
treatment. e, Quantification of CD31 staining. f, VEGF-A expression in lung 
homogenates from mice treated as in Fig. 3c measured by enzyme-linked 
immunosorbent assay (ELISA). Data are shown as means + s.e.m., n = 8 mice 
per group, pooled data from two experiments. *P < 0.05, ANOVA with 
post-hoc Bonferroni correction. g, Quantification of Ki67 staining. d (left), 

e, g, Data show means + s.e.m. of 20 fields of view on 5 sections per animal, 
n= 4 animals per group, pooled data from two experiments. *P < 0.05, 
ANOVA with post-hoc Bonferroni correction. NA, not applicable. 

h, Quantification of p-STAT3 staining. Data are shown as means + s.e.m. of 
10 fields of view on 5 sections per animal, n = 4 mice per group, pooled data 
from two experiments. *P < 0.05, ANOVA with post-hoc Bonferroni correction. 


the increases in IL-6 and VEGF-A in the lungs, p-STAT3 also increased 
upon discontinuation of anti-CCL2 (Fig. 3h). These results suggest that 
IL-6/p-STAT3 pro-angiogenic effects contributed to the metastatic 
overshoot after cessation of anti-CCL2 treatment. 

We neutralized VEGF-A in vivo to assess whether it supported tumour 
cell proliferation after anti-CCL2 or IgG treatment (Fig. 4a). This pre- 
vented metastatic overshoot, reduced blood-vessel density and reduced 
proliferation in lung metastases (Fig. 4b, c). p-STAT3 levels in the lungs 
did not change with this treatment (Extended Data Fig. 10a), suggest- 
ing that VEGF-A acts downstream of p-STAT3. 

To confirm that monocytes/macrophages contributed to the express- 
ion of VEGF-A after cessation of anti-CCL2, we assessed p-STAT3 and 
VEGF-A levels in lungs from animals treated with anti-CCL2 or IgG con- 
trol, followed by macrophage depletion with CSF1R inhibitor. p-STAT3 
and VEGF-A levels increased in lungs of animals after cessation of 
anti-CCL2 treatment, but not if anti-CCL2 treatment was followed by 
monocyte/macrophage depletion (Fig. 4d, e and Extended Data Fig. 10b). 
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Figure 4 | Combined anti-CCL2 and anti-VEGF treatment reduces 
angiogenesis and tumour cell proliferation in lung metastases. a, Timeline 
of the in vivo experiments. b, Quantification of lung metastases as percentage 
of metastatic area per lung area (left) or as area per lung section (right). 

Data are shown as means ~ s.e.m. of 20 fields of view on 5 lungs sections per 
animal, n = 8 mice per group, pooled data from two experiments. *P < 0.05, 
ANOVA with post-hoc Bonferroni correction. c, Top left, representative 
images of CD31-stained lungs on day 24. Top right, quantification of CD31 
staining. Data are shown as means + s.e.m. of 20 fields of view on 5 lung 
sections per animal, n = 4 mice per group, pooled data from two experiments. 
*P < 0.05, ANOVA with post-hoc Bonferroni correction. Scale bar, 100 um. 
Bottom left, representative images of Ki67-stained lung on day 24. Bottom 
right, quantification of Ki67 staining. Data are shown as means + s.e.m. of 
20 fields of view on 5 lung sections per animal, n = 3 mice per group, pooled 
data from two experiments. *P < 0.05, ANOVA with post-hoc Bonferroni 
correction. Scale bar, 100 jum. One representative image out of 20 is shown 
per group. d, Quantification of p-STAT3 staining. Data are shown as 

means = s.e.m. of 10 fields of view on 5 sections per animal, n = 3 mice per 
group, pooled data from two experiments. *P < 0.05, ANOVA with post-hoc 
Bonferroni correction. CSF1Rinh, CSF1R inhibitor; vhc, vehicle. e, VEGF-A 
expression in lung homogenates from mice treated as in Fig. 4a measured by 
ELISA. Data are shown as means = s.e.m., n = 3 mice per group, pooled data 
from two experiments. *P < 0.05, ANOVA with post-hoc Bonferroni 
correction. NS, not significant. 


This suggested that IL-6/p-STAT3-dependent VEGF-A secretion in 
the lung after cessation of CCL2 neutralization is indeed mediated 
by lung macrophages. 

Together, these data support the idea that the complex tumour micro- 
environment is critical for effective anti-tumour strategies. Anti-CCL2 
treatment decreased breast cancer metastases in mice, but interruption 
of anti-CCL2 treatment precipitated an unexpected influx of mono- 
cytes into the metastatic site and overshooting IL-6 levels within the 
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metastatic microenvironment. This led to local enhancement of angio- 
genesis, metastatic disease and a fatal outcome. Therefore our results 
prompt extreme caution when considering anti-CCL2 treatment of 
metastatic breast cancer and other neoplasias and suggest that thera- 
peutic interference with the tumour microenvironment might lead to 
tissue remodelling not only locally but also at remote sites such as the 
bone marrow, with unexpected far-reaching consequences including a 
worsened prognosis for cancer patients. Any tumour immunotherapy 
that only sequesters immune cells away from the tumour and that does 
not permanently reprogram the tissue microenvironment or directly 
kill tumour cells may bear a similar risk of lethal rebound. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 


Received 1 December 2013; accepted 15 September 2014. 
Published online 22 October 2014. 


1. Qian, B.Z. etal. CCL2 recruits inflammatory monocytes to facilitate breast-tumour 
metastasis. Nature 475, 222-225 (2011). 

2. Lu,X. & Kang, Y. Chemokine (C-C motif) ligand 2 engages CCR2* stromal cells of 
monocytic origin to promote breast cancer metastasis to lung and bone. J. Biol. 
Chem. 284, 29087-29096 (2009). 

3. Wolf, M. J. et al. Endothelial CCR2 signaling induced by colon carcinoma cells 
enables extravasation via the JAK2-Stat5 and p38MAPK pathway. Cancer Cell 22, 
91-105 (2012). 

4. Pietras, K. & Ostman, A. Hallmarks of cancer: interactions with the tumor stroma. 
Exp. Cell Res. 316, 1324-1331 (2010). 

5. McAllister, S. S. & Weinberg, R. A. Tumor-host interactions: a far-reaching 
relationship. J. Clin. Oncol. 28, 4022-4028 (2010). 

6. Hanahan, D.& Coussens, L. M. Accessories to the crime: functions of cells recruited 
to the tumor microenvironment. Cancer Cel! 21, 309-322 (2012). 

7. Karnoub, A. E. & Weinberg, R. A. Chemokine networks and breast cancer 
metastasis. Breast Dis. 26, 75-85 (2007). 

8. Kleer, C. G., van Golen, K. L. & Merajver, S. D. Molecular biology of breast cancer 
metastasis. Inflammatory breast cancer: clinical syndrome and molecular 
determinants. Breast Cancer Res. 2, 423-429 (2000). 

9. Palangie, T. eta/. Prognostic factors in inflammatory breast cancer and therapeutic 
implications. Eur. J. Cancer 30, 921-927 (1994). 

10. Ueno, T. et al. Significance of macrophage chemoattractant protein-1 in 
macrophage recruitment, angiogenesis, and survival in human breast cancer. Clin. 
Cancer Res. 6, 3282-3289 (2000). 

11. Valkovié, T., Lucin, K., Krstulja, M., Dobi-Babic, R. & Jonjic, N. Expression of 
monocyte chemotactic protein-1 in human invasive ductal breast cancer. Pathol. 
Res. Pract. 194, 335-340 (1998). 


LETTER 


12. Saji, H. et al. Significant correlation of monocyte chemoattractant protein-1 
expression with neovascularization and progression of breast carcinoma. Cancer 
92, 1085-1091 (2001). 

13. Chavey, C. et a/. Oestrogen receptor negative breast cancers exhibit high cytokine 
content. Breast Cancer Res. 9, R15 (2007). 

14. Goede, V., Brogelli, L., Ziche, M. & Augustin, H. G. Induction of inflammatory 
angiogenesis by monocyte chemoattractant protein-1. Int. J. Cancer 82, 765-770 
(1999). 

15. Li, X. etal. A destructive cascade mediated by CCL2 facilitates prostate cancer 
growth in bone. Cancer Res. 69, 1685-1692 (2009). 

16. Bonapace, L. etal. If you don’t look, you won't see: intravital multiphoton imaging of 
primary and metastatic breast cancer. J. Mammary Gland Biol. Neoplasia 17, 
125-129 (2012). 

17. Strachan, D.C. etal. CSF1R inhibition delays cervical and mammary tumor growth 
in murine models by attenuating the turnover of tumor-associated macrophages 
and enhancing infiltration by CD8 T cells. Oncolmmunology 2, e26968 (2013). 

18. Nilsson, M.B., Langley, R. R. & Fidler, |. J. Interleukin-6, secreted by human ovarian 
carcinoma cells, is a potent proangiogenic cytokine. Cancer Res. 65, 
10794-10800 (2005). 

19. Wani, A. A., Jafarnejad, S. M., Zhou, J. & Li, G. Integrin-linked kinase regulates 
melanoma angiogenesis by activating NF-«B/interleukin-6 signaling pathway. 
Oncogene 30, 2778-2788 (2011). 

20. Wei, D. et al. Stat3 activation regulates the expression of vascular endothelial 
growth factor and human pancreatic cancer angiogenesis and metastasis. 
Oncogene 22, 319-329 (2003). 

21. Wei, L.H. etal. Interleukin-6 promotes cervical tumor growth by VEGF-dependent 
angiogenesis via a STAT3 pathway. Oncogene 22, 1517-1527 (2003). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements We thank R. L. Anderson, N. Hynes, and R. Cardiff for cell lines, 
R. Thierry and M. Kirschmann for scripts for immunohistochemistry and two-photon 
imaging analysis, M. Stadler and H.-R. Hotz for bioinformatics, H. Kohler for FACS and 
S. Bichet for technical support, H. Brinkhaus and P. Loetscher for helpful discussions, 
D. Dylan for the CSFR1 inhibitor, J. van Rheenen, J. Stein, J. Rietdorf, T. Oertner and S. 
Bundschuh for helping us set up the multiphoton intravital microscope, and the 
Bentires-Alj group for feedback. Research in the laboratory of M.B.-A. is supported 

by the Novartis Research Foundation, the European Research Council (243211- 
PTPsBDC), the Swiss Cancer League, the Swiss National Foundation, and the Krebsliga 
Beider Basel. M.-M.C. is supported by the FP7 Marie Curie Fellowship. 


Author Contributions L.B., M.-M.C., T.J. and M.B.-A. designed and performed most of 
the experiments and wrote the manuscript. K.D.M. and Z.V. performed human biopsy 
experiments. J.W., M.-M.C. and L.B. performed intravital imaging. All authors prepared 
the manuscript. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of the paper. Correspondence 
and requests for materials should be addressed to M.B.-A. (bentires@fmi.ch). 


6 NOVEMBER 2014 | VOL 515 | NATURE | 133 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


METHODS 


Patient cohorts. Samples from paraffin-embedded invasive breast cancer tissues 
(n = 30) were retrieved from the archives of the Institute of Surgical Pathology, 
University Hospital Zurich and the Institute of Pathology, Cantonal Hospital 
Baselland, Liestal, Switzerland. The minimum follow-up time was 2 years. Clinico- 
pathological data for all patients were available. All biopsies were collected for 
previous diagnostic purposes and processed according to standard procedures. 
Two board-certified pathologists (K.D.M., Z.V.) verified the diagnoses of invasive 
breast carcinoma. 

The retrospective study on formalin-fixed, paraffin-embedded human breast 

cancer tissue was approved by the Cantonal Ethical Committee of Zurich (KEK- 
2012-553). Informed consent was not necessary, as the ethical approval covered 
the ethical issues of the retrospective study and the samples were completely anon- 
ymized and de-identified before the study. For immunohistochemical studies, two 
groups of breast cancer patients were examined: Group 1 (n = 14; 12 hormone- 
receptor positive, 2 triple negative) were primary breast cancer patients without 
lymph node or distal metastases; Group 2 patients (n = 16; 14 hormone-receptor 
positive, 2 triple negative) had both lymph node involvement and distant meta- 
stases, except for three patients with lymph node metastases whose distant meta- 
stasis status was unknown at the time of this study. 
Immunohistochemistry. For all cases, three sequential whole-tissue sections were 
stained with the antibodies anti-CD68 (DAKO-Cytomation, clone PG-M1, 1:100 
dilution), anti-CD31 (Novocastra, clone 1A10, 1:100 dilution, Leica Biosystems) and 
anti-CCL2 (R&D Systems, clone 23002, 1:100 dilution). Immunohistochemistry 
on 4-|1m sections was performed using an automated immunohistochemistry Bond 
platform (Leica Biosystems). Detection was carried out with Refine-Red or Refine- 
DAB kits (Bond), including the respective secondary antibodies. The sections were 
visualized with an Olympus BX41 microscope and attached Olympus UC30 cam- 
era using the cell” digital imaging acquisition software (Olympus). 

The number of CD68-positive macrophages per high-power field (HPF) was 
counted automatically. CD68-positive elements were only counted as macrophages 
if a nucleus could be identified in the section and only intratumoral macrophages 
were taken into account. The mean of 20 HPFs was calculated for each case. Similarly, 
the number of CD31-positive vessels per HPF was counted automatically (20 HPFs 
per case). Only areas of positive staining with a minimal diameter of 20 j1m were 
taken into account to exclude isolated CD31” cells. Cytoplasmic CCL2 staining 
within tumour cells was scored manually by a board-certified pathologist (K.D.M.) 
according toa three-tiered system: 0 (negative staining, no detectable staining), 1+ 
(staining in <10% of tumour cells), 2+ (staining in >10% of tumour cells). 

Tumours grown in mice were fixed in 10% neutral buffered formalin (NBF) for 

24h at 4 °C, washed with 70% ethanol, embedded in paraffin, sectioned at 3 jum, and 
stained with haematoxylin and eosin (H&E), anti-Ki67 (clone PA5-19462, 1:400 dilu- 
tion, Thermo Scientific) or anti-CD31 (clone 390, 1:500 dilution, Invitrogen) antibodies. 
The investigator was blinded in all the immunohistochemistry quantifications. 
Analysis of public microarray data. The publicly available processed data sets from 
The Cancer Genome Atlas (TCGA) for breast invasive carcinoma (AgilentG4502A_ 
version 2013-07-12) were downloaded from https://genome-cancer.ucsc.edu/ and 
analysed with R/bioconductor” >. Patients (all clinical subtypes) were split accord- 
ing to their CCL2 and IL-6 expression and the highest tertile were compared to the 
lowest tertile for each gene. The resulting groups were tested for difference in overall 
survival using the survival package in R by fitting a Cox proportional hazard 
regression model. 
Reagents, recombinant proteins and antibodies. Recombinant mouse JE/MCP- 
1 (CCL2) and mouse IL-6 were purchased from Peprotech (Switzerland). Rat anti- 
mouse-IL-6 (clone MAB-406) and rat IgG1 control were from R&D Systems (UK). 
Purified NA/LE hamster anti-mouse MCP-1/CCL2 (clone 2H5) and hamster 
IgG2a control were purchased from BD Bioscience (Switzerland). Goat anti-mouse 
VEGF-A (clone AF493NA) and polyclonal goat IgG (R&D systems) were used for 
cell culture (1 jg ml’) and for in vivo experiments. CSF1R inhibitor was provided 
by D. Dylan”. 

The following anti-mouse antibodies were used for flow cytometry: CD1 1b (anti- 

mouse APC-CD11b, clone M1/70), Ly6C (anti-mouse FITC-Ly6C, clone HK1.4), 
Ly6G (anti-mouse Percp-Cy5.5-Ly6G, clone IA8), CD206 (anti-mouse FITC-CD206, 
clone C068C2), I-A/I-E (anti-mouse Percp-Cy5.5-I-A/I-E, clone M5/114.15.2), 
CD11c (anti-mouse Percp-Cy5.5-CD1 Ic, clone N418), CD86 (anti-mouse Alexa- 
fluor 488 CD86, clone GL-1), all from BD Bioscience, and mCCR2 (anti-mouse FITC- 
CCR2), from R&D Systems. 
Cell culture. Syngeneic mammary cancer 4T1 cells*®, 4T1.2 cells (derived from 
Balb/c mice, from R. L. Anderson), J110 cells (derived from FVB mice, from N. 
Hynes) and Met-1 cells (from FVB mice, from R. Cardiff) were cultured in DMEM 
supplemented with 10% FCS. In co-culture experiments, 10° tumour cells were 
incubated with 10° sorted primary bone marrow monocytes in DMEM 0.5% FCS. 
Cell viability was measured using a Neubauer chamber and trypan blue. 


ELISA and cytokine arrays. CCL2 levels were assessed using the mCCL2 ELISA 
Duo-Set (R&D Systems) for mouse cells and the hCCL2 ELISA Duo-Set (R&D 
Systems) for human cells. IL-6 was measured by mIL-6 ELISA Duo-Set (R&D Systems). 
VEGF-A levels were assessed using the mVEGF-A Duo-set (R&D System). Cytokine 
arrays from R&D Systems were used according to the manufacturer’s protocol. 

Flow cytometry. Cultured adherent cells were detached using trypsin-EDTA, sus- 
pended in growth medium and counted. Tumours, lungs and bone marrow were 
mechanically and enzymatically dissociated (using collagenase II, Sigma). Cells 
were incubated with 2.5 1g antibodies per 10° cells or with 2.5 4g mouse IgG 
isotype control antibody (R&D Systems) per 10° cells for 20 min at 4 °C in the dark 
before washing and analysis. At least 10* cells per sample were analysed with a 
FACScalibur flow cytometer (Becton Dickinson). Monocytes were identified for 
quantification and sorting as ssc“CD1 1b*Ly6C* cells. The reference population 
for gating was SSC“CD11b ", thereby excluding granulocytes, as shown in Extended 
Data Fig. 5a. Absolute monocyte numbers were quantified as previously described’. 
In brief, after organ harvest, single-cell suspensions were obtained from bone mar- 
row, blood, lung, or tumour. Total viable cell numbers were determined using Trypan 
Blue. Absolute monocyte numbers were calculated as number of total viable cells 
multiplied by the percentage of SSC°"CD11b* Ly6C*Gr1* monocytes of all live cells. 
RNA and quantitative real-time PCR. Total RNA was extracted using the Trizol 
reagent according to the manufacturer’s protocol (Ambion RNA, Life Technologies). 
Aliquots of 1 jg of total RNA were transcribed using the SuperScript III First- 
strand synthesis for RT-PCR (Invitrogen). PCR and fluorescence detection were 
performed using the StepOnePlus Sequence Detection System (Applied Biosystems) 
according to the manufacturer’s protocol in a reaction volume of 20 il containing 
1X TaqMan Universal PCR Master Mix (Applied Biosystems) and 30 ng cDNA. 
For quantification of mouse Ccl2, Hprt1, Vegfa and Il6 mRNA, the 1X TaqMan 
Gene Expression Assays Mm_00441242_m1,Mm00446968_m1,Mm01281449_m1 
and Mm00446190_m1 (Applied Biosystems) were used. All measurements were 
performed in duplicates and the arithmetic means of the cycle threshold (Ct) values 
were used for calculations: target gene mean Ct values were normalized to the 
respective housekeeping gene (Hrpt1), mean Ct values (internal reference gene, 
Ct), and then to the experimental control. The values obtained were exponen- 
tiated 2(— AACt) to be expressed as n-fold changes in regulation compared with 
the experimental control (2(—AACt) by the method of relative quantification’. 
Animal experiments. Experiments using FVB and Balb/c (Charles River Laboratory 
and Jackson Laboratory, respectively) mice were carried out according to Swiss 
national guidelines on animal welfare and the regulations of the Canton of Basel Stadt. 

To induce spontaneous metastasis, 1-4 x 10° tumour cells of different lines 
were injected into the fourth mammary fat pad of 7- to 8-week-old female FVB 
or Balb/c mice. Antibody treatments were performed 3-15 days after tumour cell 
injection (depending on the cell line used) via intraperitoneal injection 3 times per 
week. Anti-CCL2, anti-IL6 and anti-VEGF antibodies or the respective controls 
were used at 1 mgkg '. The CFS1R inhibitor (200 mg kg * oral dosing) and the 
corresponding vehicle were administered daily. 

Tumour volume was calculated using the formula: length x width/2. To assess 
the number of CTCs, blood was taken from the right atrium, plated in DMEM medium 
supplemented with 10% FCS, and colonies counted on day 10 of culture. The number 
of CTCs was calculated as the total number of colonies in the dish divided by the 
volume of blood taken. To quantify metastases, the lungs were fixed in Bouin’s 
solution (Sigma Aldrich) for macrometastases or in PFA 4% for micrometastases. 

For adoptive transfer of monocytes, tumour-bearing animals were killed on day 
14 after orthotopic injection of cancer cell lines. ssc’“CD1 1b*Ly6C* monocytes 
were sorted from femoral bone marrow into PBS and 1% FCS, stained with the 
vital dye CMTMR (5-(and-6)-(((4-chloromethyl)benzoyl)amino)tetramethylrho- 
damine; Invitrogen) for 10 min at 37 °C according to the manufacturer’s protocol. 
Aliquots of 8 X 10° labelled cells were injected into the tail vein of syngeneic tumour- 
bearing mice that were treated with anti-CCL2 or the IgG control by intraperito- 
neal injection of fresh antibodies diluted in PBS (10 mg kg’). Mice were killed 1 day 
later and single-cell suspensions of the lungs, blood, bone marrow and primary 
tumours were counterstained with CD11b-APC and analysed by flow cytometry. To 
assess the effect of monocytes on metastases, 1 X 10° sorted SSC®“CD11b* Ly6C* 
cells from tumour-bearing animals were injected into the tail vein of recipient 
syngeneic tumour-bearing animals daily for 4 consecutive days. The mice were 
killed on day 11 after tumour cell injection and CTCs and metastases were assessed 
as described earlier. 

Intravital imaging. Orthotopic mammary tumours were grown for 1-2 weeks. 
Mice were anaesthetized with Attane Isofluran (Provet AG) and mounted on a 
custom-made stage. Anaesthesia was maintained throughout the experiment with 
a nose cone. Tumours were exposed by skin flap surgery** on a custom-made multi- 
photon microscope’* and imaged at 880 nm with a X25/1.05NA water immersion 
objective (Olympus) at a resolution of 1.06 1m per pixel. Cell motility was observed 
by time-lapse imaging over 30 min in 2-min cycles, where a 100 ym z-stack at 
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5-um increments was recorded for each frame starting at the tumour capsule. 
Three-dimensional time-lapse videos were analysed using Image J””. Tumour cell 
motility was quantified manually. A tumour cell motility event was defined as a 
protrusion of half a cell length or more over the course of a 30 min video. For 
visualizing vasculature, 100 ll of 20 mg ml’ 70 kDa Texas Red-dextran (Invitrogen, 
Molecular Probes) was injected into the tail vein of the mice before surgery and 
imaging of blood vessels and macrophages. Vasculature was quantified manually 
as tubes of at least 20 jum length appearing in the red channel. Permeability of the 
blood vessels was measured as the time taken by dextran to leak out from the blood 
vessels as reported previously”. 

Cell invasion. Invasion assays were performed using transwell chambers (8 um 
pore size, BD Biocoat Growth factor-reduced Matrigel Invasion Chamber) accord- 
ing to the manufacturer’s protocol. An aliquot of 5 X 10° cells in 100 pl of DMEM 
0.1% BSA was plated into the top chamber and DMEM 10% FBS alone or contain- 
ing monocytes derived from bone marrow of animals bearing autologous tumours 
was placed in the bottom chamber. After a 24h incubation at 37 °C and 5% COp, 
cells at the lower surface of the membrane were fixed with 3.7% paraformaldehyde, 
stained with 0.2% crystal violet, and washed with 1 PBS. The number of cells per 
field was quantified microscopically and total cell number was evaluated by trypan 
blue vital. 

Statistical analysis. All the in vitro experiments were performed in biological and 
technical triplicates. The number of mice was calculated by performing power 
analysis using data from small pilot experiments. Values represent the means + s.e.m. 
Depending on the type of experiment, data were tested using two-tailed Student’s 
t-test, log-rank test, or one-way ANOVA with post-hoc Bonferroni correction. 
*P < 0.05 and **P < 0.001 were considered statistically significant. 
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Immunohistochemistry on human samples was analysed with R/bioconductor* * 
using a one-way ANOVA (assuming that the data are normally distributed). The 
different box plots represent the first to the third quartile of the data. The thick line 
is the median, and the whiskers in the box plot extend to the minimal and maximal 
values, after outliers have been removed. Outliers were defined by default as all data 
points that are more than 1.5-fold the box length (the interquartile range) away 
from the median. 
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Extended Data Figure 1 | CCL2 is overexpressed in breast cancer. a, Positive supernatants of murine breast cancer cell lines grown in vitro. Data are shown 


correlation between immunohistochemical staining score of CCL2 and the as means = s.e.m., n = 3 wells, one representative of three independent 
number of CD68-positive macrophages per high-power field in human experiments. d, CCL2 expression in sera of mice bearing the indicated tumours 
biopsies (n = 30). P = 0.024; data were analysed using R with one-way for 14 days. Data are shown as means + s.e.m., n = 3 animals per group, one 
ANOVA (assuming that the data are normally distributed). The box plot representative of two independent experiments. e, CCL2 levels in sera (left) 
represents the first to the third quartile of the data. Thick line indicates median, and lung homogenate (right) from 4T1.2-tumour-bearing mice during 

the whiskers extend to the minimal and maximal values. b, Kaplan-Meier anti-CCL2 or IgG treatment or 10 days after the cessation of treatment. Data are 
survival curves showing overall survival of breast cancer patients with high or shown as means + s.e.m., n = 5 mice per group, pooled data from two 

low CCL2 expression. P = 0.0372, log-rank test. c, CCL2 expression in independent experiments. *P < 0.05, unpaired t-test. 
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Extended Data Figure 2 | Anti-CCL2 treatment reduces breast cancer 
metastases. a, Tumour volumes on day 11 of treatment with anti-CCL2 or IgG. 
Data are shown as mean tumour volumes = s.e.m., n = 10 mice, pooled data 
from two independent experiments. NS, not significant (unpaired t-test). 

b, Representative images of haematoxylin and eosin (H&E)-stained lung 
sections out of 20 images per group. Black circles indicate metastases. Scale bar, 
500 um. c, Quantification of metastases as percentage of metastatic area per 
lung area (left) or as metastatic area per lung section (right). Data are shown as 
means ~ s.e.m. of 20 fields of view on 5 sections per animal, n = 4 mice per 
group, pooled data from two experiments. *P < 0.05, unpaired t-test. d, Effect 
of anti-CCL2 and IgG treatment on CTCs from tumour-bearing animals on 
day 11 of treatment. Data are shown as means + s.e.m. of 20 fields of view on 5 
sections per animal, n = 4 mice per group, pooled data from two experiments. 
*P < 0.05, unpaired t-test. e, Intravital multiphoton images of 4T1-GFP 
primary tumours on day 7 of treatment with anti-CCL2 antibody or IgG. Left, 
representative two-dimensional images out of 24 fields from primary tumours 


(green) 50 um below the tumour surface, 24 min after intravenous injection of 
70kDa Texas Red-dextran (red). Scale bar, 100 jum. Right, mean numbers of 
motile tumour cells + s.e.m. per 30 min acquisition, 6 fields per mouse, n = 4 
mice per group. *P < 0.05, unpaired t-test. f, Intravital multiphoton images of 
vasculature in 4T1-GFP primary tumours on day 7 of treatment. Left, 
representative two-dimensional images out of 12 fields per group, 50 um below 
the tumour surface, 15 min after intravenous injection of 70 kDa Texas Red-— 
dextran (red). Scale bar, 100 jum. Right, mean number of dextran-positive blood 
vessels counted over 42 min, starting 15 min after injection of Texas Red- 
dextran. Results are shown as means + s.e.m., 1 = 4 mice per group and 3 fields 
per animal. *P < 0.05, unpaired t-test. g, Left, representative images out of 

12 images per group from %-SMA1-stained 4T1.2 tumour sections. Scale bar, 
500 pm. Right, quantification of the perimeter of blood vessels covered by 
a-SMAI staining. Data are shown as percentage vessel perimeter covered by 
a-SMA1 + s.e.m. of 12 fields of view on 4 sections per animal, n = 3 mice per 
group, pooled data from two experiments. *P < 0.05, unpaired t-test. 
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Extended Data Figure 3 | Macrophages decline in primary 4T1.2 

tumours upon treatment with anti-CCL2. a, Left, total number of 
SSC°"CD11b* CD11c*CD206* MHCII* macrophages. Right, percentage of 
CD11c*CD206* MHCII* macrophages per SSC°“CD11b™ cell population. 
Data are shown as means + s.e.m., n = 3 mice. *P < 0.05, unpaired t-test. 

b, Left, total number of Texas-Red-positive sscl’CD11b* macrophages. 
Right, percentage of Texas-Red-positive macrophages per SSC°"CD11b* cells, 


enumerated by flow cytometry of 4T 1.2 primary tumours from tumour-bearing 
mice treated for 14 days, after intravenous injection of Texas Red-dextran. Data 
are shown as means = s.e.m., 1 = 6 mice pooled data of two independent 
experiments. *P < 0.05, unpaired t-test. c, Concentration of hamster IgG in 
serum, lung homogenates and tumour during treatment (day 14) and at 10 days 
after cessation of treatment as detected by ELISA. Data are shown as 

means + s.e.m., n = 4 mice per group. *P < 0.05, **P < 0.001, unpaired t-test. 
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Extended Data Figure 4 | Cessation of anti-CCL2 treatment increases lung 
metastases. a, Top, representative H&E-stained lung sections from 4T1.2- and 
Met-1-tumour-bearing mice 10 days after cessation of treatment. Scale bar, 
500 um. Bottom, quantification of lung metastases as percentage of metastatic 
area per lung area and metastatic area per lung section. Data are shown as 
means + s.e.m. of 20 fields of view on 5 sections per animal, n = 4 mice per 
group, pooled data from two experiments. *P < 0.05, unpaired t-test. One 
representative image out of 20 is shown per group. b, Left, representative H&E- 
stained liver sections from 4T1.2-tumour-bearing mice 10 days after cessation 
of treatment. Scale bar, 500 um. Right, quantification of liver metastases as 
metastatic area per lung section. Data are shown as means ~ s.e.m. of n = 10 
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fields of view on 5 sections per animal, n = 2 mice per group. *P < 0.05, 
unpaired t-test. One representative image out of 10 is shown per group. 

c, d, 4T1.2-tumour-bearing animals were treated for 24 consecutive days with 
anti-CCL2 or IgG control. ¢, Left, representative images of H&E-stained lung 
sections. Scale bar, 500 jum. Right, lung metastases were quantified as 
percentage of metastatic area per lung area on day 24 of treatment. Data are 
shown as means + s.e.m. of 20 fields of view on 5 lungs sections per animal, 
n= 4 mice per group. *P < 0.05, unpaired t-test. One representative image out 
of 20 is shown per group. d, Number of colonies formed by CTCs per ml blood 
collected on day 24 of treatment. Data are shown as means + s.e.m. of n = 4 
mice per group. *P < 0.05, unpaired t-test. 
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Extended Data Figure 5 | CCL2 drives mutual attraction of monocytes and 
tumour cells. a, Fluorescence-activated cell sorting (FACS) plots showing the 
gating strategy for identification and isolation of inflammatory monocytes. 

b, Transwell invasion assay of Met-1 cells (left) or 4T1 cells (right) on top of 
matrigel towards Ly6C* monocytes (Mono). Data are shown as means of 
invading cell numbers + s.e.m., n = 4 independent experiments with each 
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experiment performed in triplicate. *P < 0.05, ANOVA with Bonferroni post- 
test. c, Transwell invasion assay of Ly6C* monocytes on top of matrigel 
towards 4T1 (left) or Met-1 (right) cells. Data are shown as means of invading 
cell numbers + s.e.m., n = 3 independent experiments with each experiment 
performed in triplicate. *P < 0.05, ANOVA with post-hoc Bonferroni 
correction. 
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Extended Data Figure 6 | Distribution of inflammatory monocytes assessed 
by flow cytometry during and after treatment in vivo. a, Top, quantification 
of Ly6C*Gr1* monocytes as percentage of SSC'°“CD11b* cells on day 14 of 
anti-CCL2 or IgG treatment and 10 days after treatment interruption in 4T1.2- 
tumour-bearing animals. Data are shown as means + s.e.m., n = 8 mice, pooled 
data from two independent experiments. *P < 0.05, unpaired t-test. Bottom, 
quantification of total number of monocytes on day 14 of anti-CCL2 or IgG 
treatment and 10 days after treatment interruption in 4T1.2-tumour-bearing 
animals. Data are shown as means + s.e.m., n = 8 mice, pooled data from two 
independent experiments. *P < 0.05, unpaired t-test. b, Top, quantification of 
monocytes as Ly6C* Gr1* monocytes as percentage of SSC’°"CD11b* cells on 
day 14 of anti-CCL2 or IgG treatment and 10 days after treatment interruption 
in Met-1-tumour-bearing animals. Data are shown as means + s.e.m., n = 8 
mice, pooled data from two independent experiments. *P < 0.05, unpaired 


t-test. Bottom, quantification of total number of monocytes on day 14 of anti- 
CCL2 or IgG treatment and 10 days after treatment interruption in 4T1.2- 
tumour-bearing animals. Data are shown as means + s.e.m., n = 8 mice, pooled 
data from two independent experiments. *P < 0.05, unpaired t-test. c, Top left, 
representative images of Ibal-stained lung and tumour sections on day 14 of 
treatment. Bottom left, quantification of Ibal staining. Total numbers of Ibal- 
positive cells are shown = s.e.m. of 10 fields of view on 3 lungs sections per 
animal, n = 3 mice per group. Top right, representative images of Ibal-stained 
lung and tumour sections 10 days after cessation of the treatment. Bottom right, 
quantification of Ibal staining. Total numbers of Ibal-positive cells are shown 
+ s.e.m. of 10 fields of view on 3 lung sections per animal, n = 3 mice per group. 
*P < 0.05, ANOVA with post-hoc Bonferroni correction. Scale bar, 100 pum. 
One representative image out of 10 is shown per group. NS, not significant. 
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Extended Data 1 Figure 7 | Recruitment of adoptively transferred CMTMR- 
labelled CCR2* monocytes to the primary tumour and metastatic lung 
depends on CCL2. a, Quantification of adoptively transferred CMTMR™ 
monocytes as total numbers or percentages of SSC'°"CD11b* cells in different 
organs from tumour-bearing animals on day 7 of treatment. BM, bone marrow. 
Data are shown as means ~+ s.e.m., 1 = 5 mice per group, one representative of 


Blood Lungs Tumor 


two independent experiments. *P < 0.05, unpaired t-test. b, Quantification of 
adoptively transferred CMTMR™ monocytes as total numbers or percentages 
of SSC’“CD11b* cells in different organs from tumour-bearing animals on 
day 7 after interruption of the treatments. Data are shown as means + s.e.m., 
n= 3 mice per group, one representative of two independent experiments. 
*P < 0.05, unpaired t-test. NS, not significant. 
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Extended Data Figure 8 | Monocytes support growth of tumour cells in vivo 
and in vitro. a, Timeline of the experiments. b, Left, metastatic index in 
lungs from 4T1.2-tumour-bearing animals on day 24. Data are shown as 
means + s.e.m., m = 4 mice per group, one representative of two independent 
experiments. *P < 0.05, unpaired t-test. Scale bar, 100 jm. Right, the number 
of tumour cell colonies per ml blood of tumour-bearing animals on day 24 
and after removal of the primary tumour. Data are shown as means ~ s.e.m., 
n = 4 mice per group, one representative of two independent experiments. 
*P < 0.05, unpaired t-test. c, Left, representative images of Ibal-stained lung 
sections in animals treated as in Extended Data Fig. 8a. Right, quantification of 
Ibal staining on day 14 (last day of treatment and the day of tumour removal) 


and on day 24 (10 days after stopping treatment and tumour removal). 

Total numbers of Ibal-positive cells are shown = s.e.m. of 10 fields of view on 
3 lungs sections per animal, n = 3 mice per group. *P < 0.05, unpaired t-test. 
Scale bar, 100 jim. One representative image out of 10 is shown per group. 

d, Quantification of 4T1.2 cell viability using Trypan Blue 48 h after co-culture 
with sorted monocytes (Mono). Data are shown as means + s.e.m., n = 3. 

*P < 0.05, unpaired t-test. e, Representative images of Ibal, CD31 and 

Ki67 staining of lung sections from animals treated with CSF1R inhibitor 
(CSF1Rinh) or vehicle (vhc) after treatment with anti-CCL2 or IgG control. 
One representative image out of 10 is shown per group. 
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Extended Data Figure 9 | Upregulation of CCL2 and IL-6 reduces 

overall survival in breast cancer patients. a, Correlation between 
immunohistochemical score of CCL2 and number of CD31~ vessels per 
high-power field in biopsies from patients with breast carcinoma (n = 17), 
P= 0.031. Data were analysed using R with one-way ANOVA (assuming that 
the data are normally distributed). The box plot represents the first to the third 
quartile of the data. Thick line indicates median, the whiskers in the box plot 
extend to the minimal and maximal values. b, Cytokine array showing 
expression of IL-6 in the supernatant of mono- or co-cultures of 4T1.2 cells and 
primary monocytes, in the presence or absence of anti-CCL2. c, Quantification 
of IL-6 expression in cell supernatants of monocytes in mono- or co-culture 
with 4T1.2 by ELISA. Data are shown as means + s.e.m., n = 4 independent 
experiments each with biological triplicates. *P < 0.05, ANOVA with post-hoc 
Bonferroni correction. d, IL-6 expression levels in primary tumours (left) or in 
serum (right) on day 14 of treatment and after cessation of treatment (day 24). 
Data are shown as means + s.e.m., 1 = 6 mice per group, pooled data from two 


independent experiments. *P < 0.05, ANOVA with post-hoc Bonferroni 
correction. e, Left, Kaplan-Meier survival curves showing overall survival of 
400 breast tumour patients expressing either low (blue) or high (red) IL-6 levels. 
*P < 0.045, log-rank test. Right, Kaplan-Meier survival curves showing overall 
survival of 148 patients with tumours expressing either CCL2 high/IL-6 low 
(blue) or CCL2 high/IL-6 high (red) levels. *P < 0.0017, log-rank test. f, Left, 
quantification of lung metastases. Metastatic area per lung section is shown 
+ s.e.m. of 20 fields of view on 5 lungs sections per animal, n = 4 mice per 
group. *P < 0.05, ANOVA with post-hoc Bonferroni correction. Right, 
number of colonies formed by CTCs per ml blood in animals treated as in 
Fig. 3c on day 24. Data are shown as means + s.e.m., n = 10 mice per group, 
pooled data from two experiments. *P < 0.05, ANOVA with post-hoc 
Bonferroni correction. g, VEGF-A expression as measured by ELISA in serum 
from 4T1.2-tumour-bearing animals treated as in Fig. 3c. Data are shown as 
means + s.e.m., n = 4 mice per group, pooled data from two experiments. 

*P < 0.05, ANOVA with post-hoc Bonferroni correction. NS, not significant. 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


After IgG+CSF1Rinh . 


After anti-CCL2+CSF1Rinh 


Extended Data Figure 10 | Monocytes/macrophages induce p-STAT3 in 
metastatic lungs upstream of VEGF-A expression. a, Left, representative 
images of p-STAT3-stained lung sections from animals treated as in Fig. 4a. 
Scale bar, 100 um. Right, quantification of p-STAT3 staining. Total numbers of 
p-STAT3-positive cells per field are shown + s.e.m. of 10 fields of view on 3 
lungs sections per animal, n = 3 mice per group. b, Left, representative images 
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of p-STAT3-stained lung sections from animals treated as in Fig. 2c. Scale bar, 
100 um. Right, quantification of p-STAT3 staining. Total numbers of 
p-STAT3-positive cells per field are shown = s.e.m. of 10 fields of view on 3 
lungs sections per animal, n = 3 mice per group. CSF1Rinh, CSF1R inhibitor; 
vhe, vehicle. *P < 0.05, ANOVA with post-hoc Bonferroni correction. a, b, One 
representative image out of 10 is shown per group. 
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Tumour -infiltrating Gr-1* myeloid cells antagonize 


senescence in cancer 


Diletta Di Mitri!*, Alberto Toso’, Jing Jing Chen”, Manuela Sarti!, Sandra Pinton!, Tanja Rezzonico J ost’, Rocco D’ Antuono’, 
Erica Montani?, Ramon Garcia-Escudero', Ilaria Guccini!, Sabela Da Silva-Alvarez”, Manuel Collado”, Mario Eisenberger®, 
Zhe Zhang’, Carlo Catapano’, Fabio Grassi? & Andrea Alimonti!? 


Aberrant activation of oncogenes or loss of tumour suppressor genes 
opposes malignant transformation by triggering a stable arrest in 
cell growth, which is termed cellular senescence’*. This process is 
finely tuned by both cell-autonomous and non-cell-autonomous 
mechanisms that regulate the entry of tumour cells to senescence*®. 
Whether tumour-infiltrating immune cells can oppose senescence 
is unknown. Here we show that at the onset of senescence, PTEN null 
prostate tumours in mice””’ are massively infiltrated by a population 
of CD11b*Gr-1* myeloid cells that protect a fraction of proliferat- 
ing tumour cells from senescence, thus sustaining tumour growth. 
Mechanistically, we found that Gr-1* cells antagonize senescence in 
a paracrine manner by interfering with the senescence-associated 
secretory phenotype of the tumour through the secretion of inter- 
leukin-1 receptor antagonist (IL-1 RA). Strikingly, Pten-loss-induced 
cellular senescence was enhanced in vivo when Il1ra knockout mye- 
loid cells were adoptively transferred to PTEN null mice. Therapeu- 
tically, docetaxel-induced senescence and efficacy were higher in PTEN 
null tumours when the percentage of tumour-infiltrating CD11b* Gr- 
1* myeloid cells was reduced using an antagonist of CXC chemokine 
receptor 2 (CXCR2)*. Taken together, our findings identify a novel 
non-cell-autonomous network, established by innate immunity, that 
controls senescence evasion and chemoresistance. Targeting this net- 
work provides novel opportunities for cancer therapy. 

Cellular senescence is a stable state of cell growth arrest that opposes 
tumour initiation and progression in a variety of in vivo tumour models'”. 
Recent studies have revealed an unexpected role for both adaptive and 
innate immunity in the regulation of cellular senescence. Immune cells 
can either clear senescent cells from tumours or induce senescence in can- 
cer cells by secreting pro-inflammatory cytokines” ''. However, whether 
tumour-infiltrating immune cells can also oppose senescence in vivo is 
not known. We have previously shown that complete inactivation of the 
tumour suppressor gene Pten in the mouse prostate epithelium induces 
the formation of benign tumours characterized by a strong senescence 
response that opposes tumour progression*’. However, these tumours 
grow over time and progress to become more aggressive and invasive 
tumours”””. Indeed, at the onset of senescence (7-8 weeks), PTEN null 
tumours (hereafter referred to as Pten?“—'~ tumours) are characterized 
by the concomitant presence of both senescent and proliferative cellular 
compartments, as shown by the expression of p16" “*“ (also known as 
CDKN2A), pHP1y (also known as CBX3) and senescence-associated 
B-galactosidase (SA-B-gal) and by Ki-67 positivity, respectively (Fig. la 
and Extended Data Fig. 1a—c). This finding suggests that from the early 
stages of tumorigenesis, a fraction of proliferating tumour cells evades 
senescence. Given the interplay between senescent tumour cells and 
immune cells, we speculated that Pten?“—'~ tumours may evade senescence 


in a non-cell-autonomous manner and that the tumour microenviron- 
ment could be the source of factors that hinder the senescence response. 

To address this hypothesis, we first characterized the immune micro- 
environment of Pten~‘~ tumours at the onset of senescence and found 
a strong infiltration of CD45"CD11b*Gr-1* myeloid cells (hereafter 
referred to as Gr-1° cells) (Fig. 1b and Extended Data Fig. 1d-f). More- 
over, the prostate lobes with the highest percentage of Ki-67 staining 
were the most infiltrated by Gr-1" cells (Extended Data Fig. 1g). To 
study the localization of tumour-infiltrating Gr-1* cells in the prostate, 
we adoptively transferred bone marrow precursors from mice transgenic 
for green fluorescent protein (GFP) under control of the human ubiquitin 
C (UBC) promoter’ into lethally irradiated Pten®°~'~ mice (Extended 
Data Fig. 1h). Immunofluorescence showed that GFP™ cells localized to 
the stroma and the prostate glands (Fig. lc and Extended Data Fig. li, j). 
Moreover, GFP* cells were localized in close proximity to proliferating 
(Ki-67*) tumour cells (Fig. lcand Extended Data Fig. 1)). Immune cells 
secrete a variety of cytokines that regulate senescence"’. Interestingly, 
the majority of GFP” cells were spatially distributed within 100 jum of 
Ki-67* epithelial cells (Fig. 1c), suggesting that GFP” epithelial cells may 
interfere with senescence in a paracrine manner’*”’. Notably, ~70% of 
the tumour-infiltrating GFP” cells expressed the myeloid differentia- 
tion antigen Gr-1 (Fig. 1d and Extended Data Fig. 1k). We also confirmed 
these results in tumour sections from non-irradiated Pten’*‘~ mice’® 
(Extended Data Fig. 11). To assess whether factors secreted by Gr-1~ cells 
can oppose Pten-loss-induced cellular senescence, we cultured Pten /~ 
mouse embryonic fibroblasts (MEFs), which undergo senescence in vitro””, 
in the presence of conditioned medium obtained from Gr-1* myeloid 
cells'” (Extended Data Fig. 1m, n). Surprisingly, Pten-loss-induced cel- 
lular senescence was impaired in Pten ‘~ MEFs cultured in the pres- 
ence of conditioned medium from Gr-1* cells (Fig. leand Extended Data 
Fig. 10). These data demonstrate that Gr-1~ cells oppose senescence in 
a paracrine manner. 

To identify secreted factors that mediate the anti-senescence function 
of Gr-1* cells, we compared the cytokine profile of Pten’“ ‘~ tumours 
before and after depletion of immune cells'* and found that IL-1RA was 
the cytokine present at the most reduced level after immunodepletion 
(Extended Data Fig. 2a and Supplementary Table 1). Gene expression 
analysis, quantitative PCR with reverse transcription (qRT-PCR) and 
immunofluorescence confirmed that Gr-1* cells were the major source 
of IL-1RA in Pten’*~/~ tumours relative to epithelia (Fig. 2a, b and 
Extended Data Fig. 2b). It should also be noted that CD11b* Gr-1” F4/ 
80° cells released IL-1RA in the tumour microenvironment (Extended 
Data Fig. 2c). IL-1 RA is an antagonist of IL-1R and has been reported to 
impair oncogene-induced senescence in vitro by blocking IL-10-mediated 
signalling®°. Notably, in Pten’*-‘~ tumours, the prostate lobes with the 
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Figure 1 | Gr-1* myeloid cells oppose Pten-loss-induced cellular 
senescence. a, Haematoxylin and eosin (H&E), Ki-67 and plo N<*4 
immunohistochemical staining (Ki-67 and plo NS4, blue; nuclei, brown) (left) 
and quantification (right) in tumours from 8-week-old Pten?**'* and 
Pten?*—'~ mice (n = 15 mice, 1 tumour assessed per mouse, 3 sections assessed 
per tumour, =5 fields assessed per section). Original magnification, x 400. 

b, Flow cytometry plots of CD11b*Gr-1* immune cells in tumours from 
8-week-old Pten’“/* and Pten’“'~ mice (n = 6), gating on CD45" cells. 

c, Confocal images (left) and quantification (right) of the localization and 
distance between tumour- infiltrating myeloid cells (GFP*, green) and 
proliferating epithelial cells (cytokeratin 18 (CK18), grey; Ki-67, red) in 
Pten®*~'~ UBC-GEP prostate lesions (nuclei, blue (DAPI) (n = 4 mice, 1 
tumour per mouse, 5 fields acquired, 320 cells measured). The arrow heads 
point to positions where UBC-GFP* cells colocalize in close proximity to Ki- 
67° cells. Scale bar, 10 jum. d, Representative confocal images of UBC- 

GFP‘ Gr-1* cells in Pten?“ '~ UBC-GEFP prostate lesions (nuclei, blue (DAPI). 
Scale bar, 5 um. e, Quantification of SA-B-gal staining in Pten*'* and Pten ‘~ 
cells (n = 5). CM, conditioned medium; UT, untreated. f, Cell proliferation 
of Pten*'* and Pten~‘~ cells (fold change compared with UT Pten?t!*) 
(n=5).a, ¢, e, f, Error bars, mean + s.e.m. P values were derived from an 
unpaired, two-tailed Student’s t-test (*P < 0.05; ***P < 0.001). 


highest percentage of Gr-1* myeloid cells (Extended Data Fig. 1g) dis- 
played the highest levels of IL-1RA and the lowest levels of p16'\“** 
(Extended Data Fig. 3a). To elucidate the role of IL-1u-mediated signal- 
ling in Pten-loss-induced cellular senescence, we cultured Pten~'~ MEFs 
in the presence of IL-1RA. While IL-1 was strongly expressed in senes- 
cent Pten '~ cells, unlike in Pten*'* cells, the expression of IL-1RA was 
slightly higher than in Pten*’* cells (Fig. 2c). Remarkably, treatment 
with IL-1RA decreased both SA-B-gal staining and levels of the tumour 
suppressor protein p53 in Pten ‘~ cells (Fig. 2d, e). In-vitro-polarized 
Gr-1* myeloid cells expressed high levels of Ira (Extended Data Fig. 3b). 
When Pten ‘~ MEFs were cultured in the presence of conditioned med- 
ium from Gr-1* myeloid cells derived from Il1ra knockout (Illra~'~) 
mice’*”®, Pten-loss-induced cellular senescence was not impaired (Fig. 2f, g). 
Similar results were obtained in MEFs transfected with Ha-ras’ ? (H- 
ras), suggesting that myeloid cells also oppose oncogene-induced senes- 
cence (Extended Data Fig. 3c, d). Moreover, conditioned medium from 
Gr-1* cells pre-treated with a JAK2 inhibitor failed to block Pten-loss- 
induced cellular senescence’! (Extended Data Fig. 3e, f). Importantly, 
IL-1RA also blocked docetaxel-induced senescence” in human prostate 
cancer cells (Extended Data Fig. 4a, b). 
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Figure 2 | Gr-1* myeloid cells oppose Pten-loss-induced cellular senescence 
by interfering with IL-1a signalling in vitro. a, mRNA levels of secreted 
factors (n = 3 per group). b, Representative confocal images of GFP myeloid 
cells co-expressing Gr-1 (red) and IL-1RA (grey) (nuclei, blue (DAPI)). Scale 
bar, 5 um. c, I]Ja and Il1ra mRNA levels in Pten*'* and Pten’/~ MEFs (n = 3). 
d, Percentage of SA-B-gal* cells in Pten*/* and Pten”'~ MEFs treated with 
recombinant IL-1RA (n = 4). e, Western blot for p53 in MEFs treated with 
recombinant IL-1RA. Numbers indicate fold changes in protein level. 

f, Percentage of SA-B-gal* cells in Pten*’* and Pten”'~ MEFs cultured in the 
presence of conditioned medium from IlIra*’* Gr-1* or Illra~‘ Gr-1* cells 
(n = 5). g, Western blot for p53. Numbers indicate fold changes in protein level. 
a, c, d, f, Error bars, mean = s.e.m. P values were derived from an unpaired, 
two-tailed Student’s t-test (*P < 0.05; **P < 0.01). 


We next validated our findings in a mouse model of oncogene-induced 
senescence. Ki-ras*/@'?Y mice develop both lung adenomas and ade- 
nocarcinomas that display stage-dependent expression of senescence 
markers”. Interestingly, lung adenocarcinomas were characterized by 
the infiltration of Gr-1* myeloid cells, IL-1RA expression and Ki-67 
positivity and the absence of senescence markers (Extended Data Fig. 5a). 
By contrast, senescent lung adenomas were poorly infiltrated by mye- 
loid cells (Extended Data Fig. 5a—c). These data suggest that myeloid cells 
may oppose senescence in different types of tumour, irrespective of the 
genetic background. To validate our findings in vivo, we adoptively trans- 
ferred bone marrow precursors from IlIra*'* or Illra~’~ mice into 
lethally irradiated Pten’“'~ mice (yielding Pten”~ '" Illra*’* mice and 
Pten”“'~Il1ra~'~ mice) (Fig. 3a and Extended Data Fig. 6a). Notably, 
Pten?“'~ Il1ra*'* and Pten’'~Il1ra‘~ tumours were infiltrated equally 
by Gr-1* cells (Fig. 3b). Strikingly, histopathological analysis revealed 
that Pten’“'~Illra~'~ tumours displayed an almost complete normal- 
ization of glands affected by prostatic intraepithelial neoplasia (Fig. 3c, d), 
which was associated with decreased cell proliferation, increased senes- 
cence and absence of apoptosis (Fig. 3c, e and Extended Data Fig. 6b-h), 
in contrast to Pten’*‘~ Il1ra*'* tumours. Finally, the enhanced senes- 
cence response in Pten’*~/~IlIra-‘~ tumours was associated with the 
activation of IL-1u-mediated signalling? (Fig. 3f). Myeloid cells were adop- 
tively transferred to Pten?*~'~ mice in the absence of T cells (Extended 
Data Fig. 7a, b), suggesting that Gr-1* myeloid cells oppose senescence 
through a novel pro-tumorigenic mechanism that does not involve this 
cell population*”*. These results demonstrate that Gr-1* myeloid cells 
antagonize senescence in vivo ina paracrine manner by interfering with 
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Figure 3 | Adoptively transferred IlIra_‘~ bone marrow precursors 
enhanced Pten-loss-induced cellular senescence in vivo. a, Experimental 
set-up. NK, natural killer. b, Flow cytometry plots showing equal infiltration of 
Gr-1* cells in Pten~/~Il1ra*'* and Pten”*-'~Il1ra_‘~ mice. c, H&E, Ki-67 
and p16"\“*“ immunohistochemical staining of sections from 12-week-old 
Pten?“'~Tl1ra‘!* and Pten’*/~Illra~'~ mice. Ki-67 and p16'““““, brown; 
nuclei, blue. Original magnification, x 400. d, Quantification of prostatic 
intraepithelial neoplasia (PIN)-affected glands in Pten?™'~Il1ra‘’* and 
Pten®“'~Il1ra~'~ mice. e, Confocal images of staining for CK18 (grey) and 
pHP1y (green) in prostate tumours from the indicated genotypes (nuclei, blue 
(DAPI)). Percentage of cells that stained positive for pHP1y, 15 = 7% (bottom 
left) and 37 + 13% (bottom right). Original magnification, x 400. f, IL-1 
signalling. Fold change in the expression of IL-1 target genes in tumours of the 
indicated genotypes. d, f, n = 4; 1 tumour per mouse; 3 sections per mouse; 
=5 fields per section. Error bars, mean + s.e.m. P values were derived from an 
unpaired, two-tailed Student’s t-test (*P < 0.05; **P <0.01). 


IL-1a-mediated signalling. Moreover, Pten*'~ tumours from mice 


treated with IL-1a showed a significant reduction in the percentage of 
glands affected by prostatic intraepithelial neoplasia and the number 
of Ki-67~ cells anda strong increase in p16'\“*“ expression (Extended 
Data Fig. 8a-e). This finding suggests that IL-1 mainly plays a tumour 
suppressive role in Pten-loss-induced cellular senescence. 

Different types of chemotherapy drug are known to induce senescence 
in tumours”. Therefore, we reasoned that chemotherapy-induced senes- 
cence could also be weakened by tumour-infiltrating Gr-1* myeloid 
cells. Notably, the chemokines CKCL1 and CXCL2, which act through 
the chemokine receptor CXCR2 to recruit Gr-1* myeloid cells*, were 
strongly upregulated in Pten’* /~ tumours (Supplementary Table 1 and 
Extended Data Fig. 8f). We next combined docetaxel treatment, which 
drives senescence in tumours”, with an antagonist of CXCR2 to block 
the recruitment of Gr-1* myeloid cells to Pten”~ '~ tumours (Extended 
Data Fig. 8g-j). Our pre-clinical study showed that treatment with the 
CXCR2 antagonist synergized with docetaxel. Indeed, in mice treated 
with the CXCR2 antagonist and docetaxel, we observed a strong anti- 
tumour response and a concomitant reduction in the IL-1RA levels 
(Fig. 4a, b and Extended Data Fig. 9a). These changes were associated 
with an enhanced senescence response, reduced proliferation and the 
absence of apoptosis (Fig. 4c-e and Extended Data Fig. 9b-d). Notably, 
treating Pten “ MEFs with the CXCR2 antagonist did not affect senes- 
cence (Extended Data Fig. 9e, f). Next, we assessed the levels of IL-1RA 
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Figure 4 | Impaired recruitment of Gr-1* myeloid cells enhanced 
chemotherapy-induced senescence and chemotherapy efficacy in Pten” 
tumours: relevance for human cancer. a, b, Tumour volume and 
quantification of PIN-affected glands in Pten’“'~ mice treated with 
docetaxel or a CXCR2 antagonist (CXCR2a). c-e, Ki-67 and plo N44 
immunohistochemical staining (Ki-67 and plo", brown; nuclei, blue) 
(c) and Ki-67 (d) and plo N*4 (e) quantification in Pten?*—'~ mice treated 
with docetaxel, CXCR2a or both. Scale bar, 10 tum. f, CD33 and ploNX‘4 
immunohistochemical staining in consecutive tissue microarray (n = 92) 
sections of human prostate cancer from two donors. CD33 and p16“, 
brown; nuclei, blue. Main images, magnification X 400; insets, magnification 
x 100. g, pl6™ 4 and CD33 immunohistochemical staining in sections of 
PIN and prostate cancer (PCA) from the same donor. CD33 and p16“, 
brown; nuclei, blue. Main images, magnification < 400; insets, Ki-67 positive 
cells in the same region at the same magnification. a, b, d, e, n = 5 control 
group; n = 7 treated groups; 1 tumour per mouse; 3 sections per mouse; 
=5 fields per section. Error bars, mean + s.e.m. P values were derived from an 
unpaired, two-tailed Student’s t-test (*P < 0.05; **P < 0.01; ***P < 0.001). 
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in primary tumours from patients with high-risk, localized prostate cancer 
who received docetaxel after prostatectomy in a prospective multicen- 
tre trial”*. Interestingly, patients with high levels of intratumoral IL- 
1RA did not respond to docetaxel and had a short disease-free survival 
(15 + 10 months, mean + s.e.m.; P = 0.04) compared with patients with 
normal IL-1RA levels (21 + 13 months) (Extended Data Fig. 9g). Finally, 
we looked at the correlation between tumour- infiltrating CD33* mye- 
loid cells and p16"\“*** senescent cells in a human tissue microarray 
of prostate cancer (n = 92 cases) and found that the majority of tumour 
samples infiltrated by CD33* myeloid cells stained negative for p16'“** 
(Fig. 4f and Extended Data Fig. 10a). This result was also confirmed in a 
panel of single human prostate cancer sections (n = 18), including areas 
of prostatic intraepithelial neoplasia and prostate cancer (Extended Data 
Fig. 10b). Moreover, we found that the majority of the prostatic intrae- 
pithelial neoplasia areas analysed had high p16'\“** staining and low 
CD33 and Ki-67 staining. Conversely, 90% of the prostate cancer areas 
stained negative for p16 \“*“ and positive for CD33 and Ki-67 (Fig. 4g 
and Extended Data Fig. 10c). These findings suggest that myeloid cells 
may promote tumour progression by opposing senescence in cancer in 
humans, in addition to mice. Bioinformatic analysis of data from the Pan- 
Cancer analysis project revealed that patients with low levels of PTEN 
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and high levels of ILIRA and CD33 messenger RNA had a shorter sur- 
vival than the other groups (Extended Data Fig. 10d). 

Taken together, our data allow novel insight into the mechanisms that 
regulate senescence in vivo (Extended Data Fig. 10e). Here we provide, to 
our knowledge, the first evidence that an immune cell subset can antag- 
onize senescence driven by loss of a tumour suppressor gene in vivo, 
demonstrating that senescence evasion by tumour cells can also occur 
in a non-cell-autonomous manner. This finding is of great relevance 
since senescence evasion is commonly ascribed to genetic alterations 
that are unlikely to occur with high frequency in senescent arrested cells 
or in cells committed to senescence’?”””’. 

Our study supports a model whereby Gr-1" myeloid cells protect 
Pten ‘~ tumour cells from senescence by interfering with the balance 
between IL-1o and IL-1RA in the tumour microenvironment (Extended 
Data Fig. 10f). Accordingly, treatments that block the recruitment of 
Gr-1* cells or decrease the levels of IL-1RA can tilt the IL-10 to IL-IRA 
balance, reinforcing senescence in tumours and enhancing the efficacy 
of chemotherapy. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Animals. All mice were maintained under specific-pathogen-free conditions in 
the animal facilities of the IRB Institute, and experiments were performed accord- 
ing to state guidelines and approved by the local ethics committee. Male Pten’“ /~ 
and Pten?**'~ mice were generated and genotyped as previously described*"?. Male 
CByJ.B6-Tg( UBC-GFP)30Scha/J’” transgenic mice that express GFP under the UBC 
promoter were provided by F. Grassi and were genotyped as previously described (at 
IRB Bellinzona)!3. Male Il1ra knockout mice (Illa ‘~) were provided by L. Buhler 
and were genotyped as previously described”°. 

For experiments involving animals, the sample size was chosen taking into con- 

sideration the means of the target values between the experimental group and the 
control group, the standard deviation and the statistical analysis used. For ethical 
reasons, the minimum number of animals necessary to achieve the scientific objec- 
tives was used. Animals were allocated randomly to each treatment group. Different 
treatment groups were processed identically, and animals in different treatment 
groups were exposed to the same environment. 
Human prostate samples. Human tissue microarray (TMA) analyses were carried 
out using previously published data sets*® and commercially available TMAs. Anony- 
mized human tissue samples were obtained from the Cantonal Institute of Pathology 
(Locarno, Switzerland). Informed consent was obtained from all subjects. The study 
was approved by the Canton Ticino Ethics Committee. Three-micrometre sections 
were cut from formalin-fixed paraffin-embedded (FFPE) blocks and mounted on 
positively charged slides. Histological classification was carried out on slides stained 
with haematoxylin and eosin. The histological diagnosis was determined during rou- 
tine pathological assessment. Slides were blindly evaluated by at least two investigators. 
Cells. Primary MEFs were derived from littermate embryos and obtained by cross- 
ing Pten'®*”!"* animals as previously described”*°. Embryos were harvested at 13.5 
days post coitum, and individual MEFs were produced and cultured as previously 
described*'’. At passage 2, the cells were harvested for western blot analysis. In the 
culture experiments, conditioned medium were from either Illra*!* or Illra ‘~ 
cultures. In some experiments, bone marrow precursors were collected from the 
femurs of Pten*'* mice and polarized in vitro in the presence of granulocyte- 
macrophage colony-stimulating factor (GM-CSF) and IL-6 towards the Gr-1 cell 
phenotype. On day 4, the conditioned medium from Gr-1* myeloid cells was col- 
lected, filtered and transferred to Pten '~ MEFs. Cultures were stopped 48 h later, 
and cells were harvested for protein extraction or stained for analysis (see also Ex- 
tended Data Fig. 1m for scheme). Senescence was assessed by means of an SA-B-gal 
assay (Cell Signaling Technology). Gr-1 cells were pre-treated for 24 h with NVP- 
BSK805 (a JAK2 inhibitor). 

For all of the in vitro experiments, the sample size was chosen taking into con- 
sideration the means of the target values between the experimental group and the 
control group, the standard deviation and the statistical analysis used. Most of the 
in vitro experiments were performed in a non-blinded manner. Nevertheless, in 
some cases, the results were collected by an investigator other than the one who 
performed the experiment, to ensure blinded evaluations. 

In vitro differentiation of Gr-1* myeloid cells. Gr-1~ cells were differentiated in 
vitro as previously described’”. Briefly, bone marrow precursors were flushed from 
the long bones of Pten*/* mice or UBC-GFP mice with RPMI 1640 medium. The 
cell pellet was resuspended (1 X 10° cells ml” ') in RPMI 1640 containing 10% heat- 
inactivated FBS, and the cells were cultured in vitro in the presence of 10 ng ml! 
GM-CSF and 40 ng ml" IL-6. On day 4, the cells were harvested and analysed by 
flow cytometry and qPCR (see also Extended Data Fig. 1m for scheme). 

Bone marrow transplantation. Bone marrow was flushed from the long bones of 
male IIra*/* or Illra~‘~ mice under sterile conditions with RPMI 1640 or HBSS 
using a 21-gauge needle. Mononuclear cells were filtered, collected and checked for 
viability using trypan blue. Before transplantation, the bone marrow derived from 
donor mice was depleted of CD3* T cells, NK1.1* NK cells and CD19* B cells by 
magnetic bead separation. Recipient C57BL/6 Pten? <~!~ mice were given 900 cGy 
total-body irradiation (at 7-8 weeks of age), and all mice received an eye inoculum 
comprising 4.0 X 10° bone marrow cells from either Ilra*'* or Il1ra_'~ mice. Bone 
marrow precursors were delivered 2 h after irradiation. All mice (n = 4 per group) 
survived (see also the scheme in Fig. 3a). 

Western blotting, immunohistochemistry and immunofluorescence. Tissue 
and purified epithelial lysates were prepared with RIPA buffer (1X PBS, 1% NP-40, 
0.5% sodium deoxycholate, 0.1% SDS and protease inhibitor cocktail (Roche)). The 
following antibodies were used for western blotting: rabbit polyclonal anti-p16'“““* 
(M156; Santa Cruz Biotechnology), and mouse monoclonal anti-B-actin (AC-74; 
Sigma), anti-PAI1 (H-135; Santa Cruz Biotechnology), anti-IL-1RA (H-110; Santa 
Cruz Biotechnology) and anti-E-cadherin (36/E-cadherin; BD Biosciences). For 
immunohistochemistry (IHC), tissues were fixed in 10% formalin and embedded 
in paraffin in accordance with standard procedures. Sections were stained with anti- 
p53 (Accurate Chemical), anti-p1 6 NK44 (M156; Santa Cruz Biotechnology), anti- 
Ki-67 (clone SP6; Lab Vision), anti-PAI1 (H-135; Santa Cruz Biotechnology) or 


anti-cleaved-caspase-3 (9661, Cell Signaling Technology) antibodies. Immunofluo- 
rescence (IF) on paraffin-embedded sections was conducted with anti-vimentin 
(RV202; Abcam), anti-Ki-67 (clone SP6; Lab Vision) and anti-Ly-6G (Gr-1) anti- 
bodies. Confocal images were obtained with a TCS SP5 confocal microscope (Leica). 
Prostatic epithelial cell purification and cytokine array. Eight-week-old PtenPs*!* 
and Pten?*/~ mice were euthanized, and whole prostates were isolated and pro- 
cessed to single-cell suspensions’* for magnetic-activated cells sorting (MACS). 
Single cells were stained with FITC-anti-CD34 (stroma), FITC-anti-Ter119 (ery- 
throcytes), FITC-anti-CD31 (endothelial cells) and FITC-anti-CD45 (leukocytes) 
antibodies and incubated for 20 min on ice. All antibodies (BD Biosciences) were 
used at 1:300; cells were then loaded onto an MS column (Miltenyi Biotec) for MACS 
separation, and unstained epithelial cells were collected in the negative fraction. 
Purified prostatic epithelial cells were processed as indicated by the manufacturer’s 
instructions in the cytokine array kit (R&D Systems). Developed films were scanned, 
and the obtained images were analysed using ImageJ 1.43u; background signals were 
subtracted from the experimental values. 

Osmotic pump implantation. Micro-osmotic pumps filled with PBS or recom- 
binant IL-1o. (3 pg kg” ') were implanted in the peritoneal cavity of two groups of 
age-matched Pten®*'~ mice, to expose the prostate tissue to a continuous and con- 
trolled concentration of vehicle or protein, respectively. Briefly, mice were anaes- 
thetized, and a midline skin incision was made in the lower abdomen. A pump was 
inserted into the peritoneal cavity; the muscle layer was sutured; and the skin incision 
was closed with wound clips or suturing. 

Autopsy and histopathology. Animals were autopsied, and all tissues were exam- 
ined regardless of their pathological status. Normal and tumour tissue samples were 
fixed in 10% neutral-buffered formalin (Sigma) overnight. Tissues were processed 
by ethanol dehydration and embedded in paraffin according to standard protocols. 
Sections (5 um) were prepared for antibody detection and haematoxylin and eosin 
staining. To evaluate evidence of invasion, sections were cut at 20-|1m intervals 
through the tissue and stained with haematoxylin and eosin. Slides were prepared 
containing three to five of these sections. 

Flow cytometry analysis. For phenotype analysis, the isolated cells were re-suspended 
in PBS containing 1% FCS (Sigma-Aldrich) and were pre-incubated with a purified 
anti-mouse CD16/CD32 antibody (eBioscience) for 30 min at room temperature. 
The cells were then washed and stained for 15 min at room temperature with the 
following anti-mouse monoclonal antibodies: CD45 eFluor 450 (clone 30-F11); 
Gr-1-PE (clone RB6-8C5); CD11b-APC (clone M1/70); F4/80 eFluor780 (clone 
BM8); NK1.1-PE (clone PK13); and CD19-FITC (clone 6D5). All of the antibodies 
were purchased from eBioscience. Samples were acquired on a FACSCanto II flow 
cytometer (BD Biosciences) after fixation with 1% formaldehyde (Sigma-Aldrich). 
When needed, cells were sorted from the prostate single-cell suspension using a 
FACSAria Cell Sorter (BD Biosciences) after staining with specific antibodies for 
30 min at 4 °C in PBS containing 1% FCS. Data were analysed using FlowJo soft- 
ware (Tree Star). 

Treatment of mice with CXCR2 antagonist and docetaxel. For Gr-1* myeloid 
cell depletion, 7-week-old Pten~’~ mice were intraperitoneally injected with a CXCR2 
antagonist (SB265610, 2 mg kg’ in sterile PBS; Tocris) once a day. For the combi- 
natorial treatment, docetaxel was intraperitoneally injected (10 mg kg” ') once a week 
for 3 weeks. Animals were killed at 10 weeks of age, and prostate tissues were harvested. 
Quantitative PCR (qPCR). RNA isolation (QIAGEN) and TaqMan reverse tran- 
scriptase reactions (Applied Biosystems) were performed according to the manu- 
facturer’s instructions. qPCR reactions (SYBR Green system; Bio-Rad) for each sample 
were conducted in triplicate. The primer sequences were obtained from PrimerBank 
(http://pga.mgh.harvard.edu/primerbank/index.html). Each value was normalized 
to the Gapdh level as a reference. The primer sequences used were as follows: Pail 
forward, 5’-TTGAATCCCATAGCTGCTT-3’; Pail reverse, 5’-GACACGCCATA 
GGGAGAGA-3’; p16" forward, 5'-CGCAGGTTCITGGTCACTGT-3’; p16" 
reverse, 5'-TGTTCACGAAAGCCAGAGCG-3’; [16 forward, 5'-TAGTCCTTCC 
TACCCCAATTT-3’; 116 reverse, 5'-TTGGTCCTTAGCCACTCCTTC-3’; Ccl2 
forward, 5'-GTGGGGCGTTAAACTGCAT-3’; Ccl2 reverse, 5’-CAGGTCCCT 
GTCATGCTTCT-3’; Tgfb forward, 5'-CTCCCGTGGCTTCTAGTGC-3’; Tefb 
reverse, 5’-GCCTTAGTTTGGACAGGATCTG-3’; IlIra forward, 5’-CTGCAC 
TTCCACAGTCCAGA-3’; Ira reverse, 5'-CTTAGCCCGCTTCAGCTCTTT-3’; 
Illa forward, 5'-CGAAGACTACAGTTCTGCCAT-3’; Illa reverse, 5'’-ATATG 
TGATGCCCTGGTGGT-3’; Gapdh forward, 5'-AGGTCGGTGTGAACGGAT 
TTG-3’; and Gapdh reverse, 5’-TGTAGACCATGTAGTTGAGGT-3’. 
Genome-wide gene expression analysis. Total RNA was isolated from epithelial 
and myeloid cell populations using an miRNeasy Mini kit (QIAGEN) following the 
manufacturer instructions and was quantified using a NanoDrop ND-1000 Spec- 
trophotometer (NanoDrop Technologies). RNA quality was assessed using an Agilent 
2100 Bioanalyzer (Agilent Technologies). Gene expression profiling was carried out 
using the one-colour labelling method. Labelling, hybridization, washing and slide 
scanning were performed following the manufacturer’s protocols. Briefly, equal 
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amounts of total RNA (100 ng) were amplified, labelled with Cy3 and purified with 
spin columns. Labelled specimens (600 ng) were hybridized to SurePrint G3 Mouse 
GE 8X60K Gene Expression Microarrays (Agilent). After 17 h, slides were washed 
and scanned (G2505C scanner, Agilent Technologies). 

Gene expression data analysis. Images were analysed using Feature Extraction 
software v10.7 (Agilent). Raw data elaboration was carried out with Bioconductor 
(http://www. bioconductor.org), using the R (v3.0.2) statistical environment. Back- 
ground correction was performed with the normexp method with an offset of 50, 
and quantile was used for between-array normalization. The LIMMA (LInear Models 
for Microarray Analysis) package was then used to identify differentially expressed 
genes, using the empirical Bayes method to compute a moderated t-statistic. Gene 
set enrichment analysis (GSEA; v2.07) was performed to examine the association 
between predefined gene sets and gene expression profiles of selected samples. 
Survival curves. Differential survival between patient subgroups was plotted and 
calculated using Kaplan-Meier curves. Patients were stratified based on ILIRA and 
CD33 score values. Briefly, scores were rank ordered and divided into seven percentiles 
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(from the lowest to the highest values). Such stratification showed significant dif- 
ferences in overall survival within The Cancer Genome Atlas Pan-Cancer analysis 
project (log-rank test). The Pan-Cancer data set matrix and clinical information”! 
were downloaded from the UCSC Cancer Genomics Browser (https://genome-cancer. 
ucsc.edu/). 

Statistical analysis. Statistical analysis of the data was performed using a two-tailed, 
unpaired Student’s t-test. Values are expressed as mean + s.e.m. (*P < 0.05; **P < 
0.01; ***P < 0.001). Significant differences in survival curves were calculated using 
the log-rank test. Correlation analysis in TMA staining evaluation was conducted 
with Fisher’s exact test, using the estimated percentage of positively stained cells as 
determined by a pathologist (MLS.). 


30. Alimonti, A. et a/. Subtle variations in Pten dose determine cancer susceptibility. 
Nature Genet. 42, 454-458 (2010). 

31. The Cancer Genome Atlas Research Network. The Cancer Genome 
Atlas Pan-Cancer analysis project. Nature Genet. 45, 1113-1120 
(2013). 
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Extended Data Figure 1 | Gr-1* myeloid cells infiltrate senescent tumours 
in Pten’*~'~ mice. a, Representative SA-f-gal staining of prostate sections 
from 8-week-old Pten’“*'* and Pten’*~/~ mice. Original magnification, x 400. 
b, Representative confocal immunofluorescence (IF) images showing staining 
of the epithelial marker cytokeratin 18 (CK18) (grey) and the senescence 
marker pHP1y (green) in prostate tumours from Pten?*‘~ mice. Cells were 
counterstained with the nuclear marker DAPI (blue). A Pten?*t’* prostate 
negative for pHP1y staining is also shown (inset). c, Representative confocal IF 
image showing proliferating epithelial cells (CK18, grey; Ki-67, red) and 
senescent epithelial cells (cytokeratin 18, grey; pHP1y, green) in Pten?* /~ 
prostate lesions. Cells were counterstained with the nuclear marker DAPI 
(blue). The histogram shows the quantification of CK18* pHP1y" Ki-67-, 
CK18" pHP1y Ki-67* and CK18* pHP1y* Ki-67* cells (n = 3; 1 tumour 
per mouse; 10 fields acquired; 412 cells counted). d, Quantification of 
CD11b*Gr-1* immune cells in 8-week-old Pten?“*'* and Pten’“'~ mice 

(n = 6). e, Flow cytometry analysis showing the heterogeneity of the 
tumour-infiltrating CD45*CD11b*Gr-1* immune cells in Pten?*/~ 
prostates. f, In Pten”*—'~ tumours, the senescence response starts at 8 weeks 
of age (top). A time course experiment is shown, indicating the recruitment 
of Gr-1* myeloid cells in Pten®*'~ and Pten®'~ mice at the onset of 
tumorigenesis (bottom) (n = 3 per group; 1 tumour per mouse). g, Correlation 
between Ki-67 staining and percentage of Gr-1* myeloid cells in the anterior 
(AP) and dorsolateral lobes (DLP) of Pten’*~'~ tumours (n = 3; 1 tumour 
per mouse). h, Experimental scheme. Pten?*~/~ mice were lethally irradiated 
and then transferred with bone marrow from UBC-GFP mice that had been 
depleted of T-, B- and natural killer (NK) cells. Prostate tissues were collected 
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4 weeks after transfer. i, Representative confocal IF images showing the 
localization of myeloid cells (green) infiltrating the anterior prostate gland 

of Pten’*~/~ UBC-GFP mice. Proliferating cells (Ki-67, red) and stroma 
(vimentin, grey) are also shown. Cells were counterstained with DAPI (blue). 
j, Representative confocal IF image showing the localization of tumour- 
infiltrating UBC-GFP cells and proliferating epithelial cells (CK18, grey; 
Ki-67, red) in prostate lesions from Pten?'~ UBC-GFP mice. Cells were 
counterstained with the nuclear marker DAPI (blue). Arrows indicate 

CK18* Ki-67* cells, which were considered for the analysis, while * indicates 
CK18 Ki-67* cells, which were excluded from the analysis. k, Quantification 
of UBC-GEP* Gr-1* cells (n = 4; 1 tumour per mouse; 5 fields acquired; 

300 cells counted). 1, Representative confocal IF image showing the localization 
of tumour-infiltrating myeloid cells (Gr-1, red) and proliferating epithelial cells 
(CK18, grey; Ki-67, green) in prostate lesions from non-irradiated Pten?©/— 
mice. Cells were counterstained with the nuclear marker DAPI (blue). The 
histogram shows the quantification of the distance between tumour-infiltrating 
Gr-1*CK18~ myeloid cells and CK18" Ki-67~ proliferating epithelial cells 

(n = 3; 10 fields acquired; 334 measurements). The arrows indicate CK18" Ki- 
67° cells, which were considered for the analysis. m, Experimental set-up. 

n, Flow cytometry and qRT-PCR analysis of Gr-1* myeloid cells differentiated 
in vitro in the presence of granulocyte-macrophage colony-stimulating 
factor (GM-CSF) and IL-6, showing upregulation of Gr-1 and JJ/10 mRNA. 

o, SA-B-gal staining of. Pten~'~ MEBs. ¢, d, f, g,k,1,n, Error bars, mean + s.e.m. 
P values were derived from an unpaired, two-tailed Student’s t-test 

(**P < 0.01; ***P < 0.001). 
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Extended Data Figure 2 | Gene expression analysis of factors expressed by | FACSAria Cell Sorter. Total RNA was isolated from the epithelial and myeloid 


Gr-1* myeloid cells and epithelial cells sorted from Pten’*‘~ tumours. cell populations, and gene expression profiling was carried out using the 

a, Experimental set-up (top). Protein levels of IL-1RA in Pten’“/~ bulk one-colour labelling method, performing two replicates for each condition. 
prostate tumours and Pten’“ ‘~ immunodepleted prostatic epithelial cells A heatmap displaying the mRNA expression of 53 secreted factors is shown 
(bottom) (n = 3 per group; 1 tumour per mouse). b, Experimental set-up.Gene  (n = 2 per group). c, GaT-PCR analysis of CD11b" Gr-1" F4/80* sorted from 
expression analysis of epithelial cells and Gr-1* myeloid cells purified from Pten“'~ prostate tumours, showing IlIra expression (n = 3). a, c, Error bars, 


Pten’*'~ prostate tumours. Briefly, prostates were isolated from 8-week-old mean + s.e.m. P values were derived from an unpaired, two-tailed Student’s 
Pten?“'~ mice and processed to a single-cell suspension. CD45 epithelial t-test (**P < 0.01; ***P < 0.001). 
cells and CD45*CD11b'Gr-1* myeloid cells were further sorted using a 
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Extended Data Figure 3 | Gr-1* myeloid cells oppose senescence in both 
Pten-loss-induced cellular senescence and oncogene-induced senescence. 


from Gr-1* myeloid cells (n = 3). e, I!Jra mRNA expression of Gr-1~ 
myeloid cells differentiated for 4 days with IL-6 and GM-CSF, in the absence or 
presence of the JAK2 inhibitor NVP-BSK805, compared with bone marrow 


a, Western blot analysis showing the inverse correlation between IL-1RA and 
ple N“4 protein levels in the anterior (AP) and dorsolateral lobes (DLP) of 
Pten?*‘~ prostate tumours. Levels are normalized to E-cadherin expression. 
b, [Ira mRNA expression of bone marrow precursors (BMPs) and Gr-1* 
myeloid cells sorted from Pten’* '~ prostate tumours (n = 3). ¢, Cell growth of 
H-ras MEFs cultured in the presence of conditioned medium from Gr-1* 
myeloid cells (n = 3). d, Quantification (left) and representative images (right) 
of SA-B-gal* H-ras MEFs cultured in the presence of conditioned medium 


precursors (BMPs) (n = 3). f, Quantification (left) and representative images 
(right) of SA-B-gal* Pten-'~ MEFs cultured in the presence of conditioned 
medium from Gr-1* myeloid cells that had been pre-treated with the JAK2 
inhibitor NVP-BSK805 (n = 3). a-f, Error bars, mean + s.e.m. P values 
were derived from an unpaired, two-tailed Student’s t-test (**P < 0.01; 
***D < 0,001). 
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Extended Data Figure 4 | IL-1RA opposes docetaxel-induced senescence in 
LNCaP cancer cells in vitro. a, Histogram showing the quantification of 
pHP1y* cells. Briefly, LNCaP prostate cancer cells were cultured in the absence 
or presence of docetaxel, with or without human recombinant IL-1RA. After 
5 days, cells were collected and stained for immunofluorescence analysis. 
Representative confocal IF staining showing senescent pHP1y~ (red) LNCaP 


cancer cells (inset). Cells were counterstained with the nuclear marker DAPI 
(blue). b, Cell growth of LNCaP cells cultured in the absence or presence of 
docetaxel, with or without human recombinant IL-1RA (n = 3). a, b, Error 
bars, mean + s.e.m. P values were derived from an unpaired, two-tailed 
Student’s t-test (*P < 0.01; ***P < 0.001). 
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Extended Data Figure 5 | Gr-1* myeloid cells infiltrate adenocarcinoma 
areas in lungs from K-rast/©¥ mice. a, Haematoxylin and eosin (H&E), 
ploN*44, 1948", Gr-1, IL-IRA and Ki-67 immunohistochemical staining in 
lungs from K-ras*'“'*" mice. Original magnification, 400. Staining of both 
adenocarcinoma and adenoma areas is shown (left). Histograms showing 
quantification of cells positive for p16™**, p194®", Gr-1, IL-IRA and Ki-67 
(right) (n = 7; 3 sections per mouse; =5 fields per section analysed). 

b, c, Representative confocal IF images showing staining of Ki-67 (green) 


LETTER 


Adenoma 


L 
ts} 


% of p1inkter 
8 


RN B ®@ @ oo 


% of p19ARF* 


i=} 


% of Gr-1* 


% of IL-1ra* 


% of Ki-67* 


Adenoma 


Adenocarcinoma 


DAPW/Gr-1/Ki-67 


Gr-1/Ki-67 


Zoom Gr-1/Ki-67 


and the myeloid marker Gr-1 (red) with (b) or without (c) the epithelial marker 
CK18 (grey), in adenocarcinoma and adenoma areas of lungs from K-ras@7Y 
mice. Cells were counterstained with the nuclear marker DAPI (blue). 

Panel b magnification X 400. Panel c magnification < 200. Top panel insets in 
c show H&E staining (magnification < 100) of the same areas stained for IF; 
bottom panels, magnification < 400. a, Error bars, mean = s.e.m. P values 
were derived from an unpaired, two-tailed Student’s t-test (*P < 0.05; 

**P <0.01; ***P < 0.001). 
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Extended Data Figure 6 | Senescence and apoptotic markers in Pten**/* Tira*'* and Pten’“'~Il1ra~'~ mice. The histograms show 
Pten?“*"* I1ra*'* and Pten’“'~ Il1ra_‘~ mice. a, mRNA levels of Ira quantification of Ki-67 and p16'N™ positivity (n = 4 mice per group; 

in BMPs from the indicated genotypes (n = 3). b, Quantification of Ki-67 1 tumour per mouse; 3 sections per tumour; 3 fields per section). 

staining. c, d, pl6'““*“ mRNA and protein levels. e, PAI1 mRNA and protein —_h, Immunohistochemistry for cleaved caspase-3 in prostate tissues from 
levels. f, SA-B-gal staining in prostate tissues from Pten?**'* Tlira*'* and Pten?**'* Tl1ra*'* and Pten?/“Illra_'~ mice. a-e, g, h, Error bars, 
Pten?“'~TlIra ‘~ mice (n = 4 mice per group; 1 tumour per mouse; mean + s.e.m. P values were derived from an unpaired, two-tailed Student’s 
3 sections per tumour; =5 fields per section). g, Ki-67 and ple t-test (*P < 0.05; **P < 0.01; ***P< 0.001). 

immunohistochemical staining of stage-matched prostate tumours from 
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Extended Data Figure 7 | Efficiency of magnetic-activated cell sorting 
(MACS) purification and bone marrow transplantation. a, Representative 
flow cytometry plots showing whole bone marrow cells, phycoerythrin (PE)- 
positive cells isolated after magnetic separation and bone marrow cells depleted 
of T, NK and B cells before adoptive transfer to irradiated Pten’*'~ mice 
(gating on total cells). Briefly, cells were flushed from the long bones of donor 
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mice and stained with the following anti-mouse antibodies: anti-CD3-PE, 
anti-NK1.1-PE and anti-CD19-PE. The cells were then washed and stained 
with anti-PE magnetic beads and collected for magnetic separation. 

b, Representative plots obtained from flow cytometry analysis of splenocytes 
isolated from Pten’“ ‘~ mice before and after lethal irradiation (top). Immune 
reconstitution 14 days after bone marrow transplantation (bottom). 
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Extended Data Figure 8 | Effect of IL1-a and CXCR2a on Pten?*~/~ 
tumours. a, Experimental scheme. Briefly, osmotic pumps were implanted 
in the peritoneal cavity of six Pten’“'~ mice, to expose the prostate tissue 

to a continuous and controlled concentration of either IL-1o or PBS. 

b, Immunohistochemical staining (H&E, Ki-67 and plo) of prostate 
sections from Pten?°~'~ mice treated with IL-1o or PBS. c-e, Histograms 
showing quantification of glands affected by prostatic intraepithelial neoplasia 
(PIN) (c), Ki-67 positivity (d) and plo NS“ positivity (e) (n = 3 per group; 

1 tumour per mouse; 3 sections per mouse; 3 fields per sections were analysed). 
f, Protein profile of immunodepleted epithelial cells showing the high levels of 
cytokines that recruit (CXCL1 and CXCL2) and activate (GM-CSF and IL-6) 
Gr-1* myeloid cells in Pten’“-'~ prostate tumours. g, Experimental set-up. 
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Doce, docetaxel. h, Flow cytometry plots showing the reduced recruitment of 
Gr-1* myeloid cells in Pten®“~'~ mice after treatment with a CXCR2 
antagonist (CXCR2a), with gating on live CD45* cells. The histogram shows 
the frequency of Gr-1* myeloid cells (n = 5 control group; n = 7 treated 
groups). i, j, Flow cytometry plots showing the recruitment of Gr-1* myeloid 
cells to the peripheral lymph nodes (upper panels) and spleen (lower panel) 
isolated from Pten?°'~ mice, after treatment with CXCR2a, with gating on 
live cells. The histograms (right) show the frequency of Gr-1* myeloid cells 
in the lymph nodes and spleen from Pten’“/~ mice, after treatment with 
CXCR2a (n = 5 mice per group). c-f, h-j, Error bars, mean + s.e.m. P values 
were derived from an unpaired, two-tailed Student’s t-test (*P < 0.05; 

**P < 0.01; ***P < 0.001). 
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Extended Data Figure 9 | Treatment with a CXCR2 antagonist in vivo. 

a, Il1ra mRNA levels in Pten’* ‘~ tumours after treatment with CXCR2a alone 
or in combination with docetaxel. b, Immunohistochemical staining (H&E and 
SA-B-gal) in mice treated with CXCR2a and docetaxel. c, Quantification of 
cleaved caspase-3 in Pten?*'~ tumours after the indicated treatments. 

d, Immunohistochemical staining for Ki-67 in stage-matched prostate tumours 
from Pten?*~'~ mice after treatment. The histograms show quantification 
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CXCR2a (n = 3). NS, not significant. g, Staining and quantification of IL-1RA 
in primary tumours from patients. Responder patients (“R”) and non- 
responder patients (“NR”), based on disease-free survival. a, c-e, Error bars, 
mean + s.e.m. P values were derived from an unpaired, two-tailed Student’s 
t-test (*P < 0.05; **P < 0.01). b-d, Control n = 5; treated n = 7; 3 sections per 
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Extended Data Figure 10 | Evidence in human samples and proposed 
model. a, b, Graphs showing the inverse association between p16'\“** and 
CD33 in the tissue microarrays and single prostate sections from human 
prostate cancer. Box plots in a show the interquartile range, whiskers show the 
full range. c, Histogram showing the percentage of cases positive for 

Ki-67, plo ““*“ and CD33 in sections. Normal-like prostate areas were 
compared with PIN and prostate cancer (PCA) areas in the same section. 

d, Kaplan-Meier analysis (see the Survival curves subsection in Methods). 


W IL-1 receptor 


e, Gr-1* myeloid cells recruited to the tumour site oppose Pten-loss-induced 
cellular senescence by secreting IL-1RA in the tumour microenvironment. 

f, Gr-1* myeloid cells can protect tumour cells from senescence by tilting 
the balance between IL-1 and IL-1RA in the tumour microenvironment. 
Pharmacological interventions aimed at impairing Gr-1* myeloid cell 
recruitment (for example, CXCR2a) can enhance senescence, thus improving 
chemotherapy efficacy. CIS, chemotherapy induced senescence. a, Correlation 
assessed with Fisher’s exact test. 
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Broad and potent HIV-1 neutralization by a human 
antibody that binds the gp41-gp120 interface 


Jinghe Huang’, Byong H. Kang", Marie Pancera’, Jeong Hyun Lee*4, Tommy Tong”, Yu Feng*, Hiromi Imamichi’, 

Ivelin S. Georgiev’, Gwo-Yu Chuang”, Aliaksandr Druz?, Nicole A. Doria-Rose?, Leo Laub!, Kwinten Sliepen®, Marit J. van Gils®, 
Alba Torrents de la Pefia®, Ronald Derking®, Per-Johan Klasse’, Stephen A. Migueles', Robert T. Bailer?, Munir Alam§, 

Pavel Pugach’, Barton F. Haynes’, Richard T. Wyatt?*, Rogier W. Sanders®”, James M. Binley”, Andrew B. Ward?*, 


John R. Mascola”, Peter D. Kwong” & Mark Connors! 


The isolation of human monoclonal antibodies is providing import- 
ant insights into the specificities that underlie broad neutralization 
of HIV-1 (reviewed in ref. 1). Here we report a broad and extremely 
potent HIV-specific monoclonal antibody, termed 35022, which binds 
a novel HIV-1 envelope glycoprotein (Env) epitope. 35022 neutra- 
lized 62% of 181 pseudoviruses with a half-maximum inhibitory con- 
centration (IC59) <50 pg ml~'. The median ICs of neutralized viruses 
was 0.033 pg ml” ', among the most potent thus far described. 35022 
did not bind monomeric forms of Env tested, but did bind the trimeric 
BG505 SOSIP.664. Mutagenesis and a reconstruction by negative- 
stain electron microscopy of the Fab in complex with trimer revealed 
that it bound to a conserved epitope, which stretched across gp120 
and gp41. The specificity of 35022 represents a novel site of vulner- 
ability on HIV Env, which serum analysis indicates to be commonly 
elicited by natural infection. Binding to this new site of vulnerabil- 
ity may thus be an important complement to current monoclonal- 
antibody-based approaches to immunotherapies, prophylaxis and 
vaccine design. 

Induction of a potent neutralizing antibody response capable of rec- 
ognizing highly diverse isolates of HIV-1 is one of the most important 
goals of HIV vaccine research. This represents a considerable challenge 
given the extraordinary antigenic variability of the Env surface glycopro- 
tein. However, approximately 20% of the HIV-infected population does 
develop a humoral immune response capable of recognizing highly di- 
verse strains’ °. In the past several years improved patient cohorts” ®, 
HIV-specific B-cell isolation’’, and IgG cloning techniques'®"' have 
permitted extraordinary progress in the isolation of broadly neutral- 
izing monoclonal antibodies (bNabs) from these individuals. Thus far, 
these primarily fall into four categories based on the position of their 
epitopes on the Env protein, a trimer of gp120 and gp41 heterodimers 
that is the target of neutralizing antibodies. These sites include the CD4- 
binding site on gp120 (refs 8, 12) (of which VRCO1 is an example), the 
glycan-containing regions of V1V2 on gp120 (of which PG9 and PG16 
are examples), the V3 region centred on the N332 glycan of gp120 (refs 
7, 13) (of which PGT121 is an example) and the membrane-proximal 
external region (MPER) on gp41 (of which 10E8 is an example)'*”*. It 
remains unclear to what extent these four categories represent the prev- 
alent and immunodominant sites of Env vulnerability through which 
broad neutralizing responses are mediated, or whether additional spec- 
ificities exist'®’. 

Here we report the isolation of a broad and potently neutralizing HIV- 
specific monoclonal antibody, 35022, that binds a novel epitope. The 
neutralizing activity of 35022 is highly complementary to the activities 
of other known bNabs. We used mutagenesis, crystallography and electron 


microscopy to define the Env site targeted by 35022. Our results indi- 
cate that 35022 neutralization occurs by a novel mode of trimer recog- 
nition along a conserved face on contiguous areas of gp41 and gp120. 

For a better understanding of the specificities that underlie broadly 
neutralizing antibody responses we applied a technique to identify human 
monoclonal antibodies of interest from peripheral blood B cells without 
previous knowledge of the target specificity’. IgG* B cells of a donor 
(N152) with broad and potent neutralizing serum and from whom re- 
cently described 10E8 antibody was cloned” were sorted and expanded. 
The supernatants of B-cell microcultures were screened for neutralizing 
activity and IgG genes from positive wells were cloned and re-expressed. 
In addition to the 10E8 antibody, eight clonal family variants of an addi- 
tional antibody with neutralization activity were found, among which 
the 35022 antibody was the most potent and broad (Supplementary 
Table 1a, b). This antibody was derived from IGHV- 1-18*02 and IGLV- 
2-14*02 germline genes, and was highly somatically mutated in variable 
genes of both heavy chain (35%) and A light chain (24%) compared to germ 
line. The 35022 antibody possessed a heavy-chain complementarity- 
determining 3 region (CDR H3) composed of 14 amino acids (Fig. la 
and Supplementary Table 2) and an insertion of 8 amino acids in frame- 
work 3 (FR3). High levels of somatic mutation and FR3 insertions are 
features of other HIV-specific bNabs’**"*7!”, Autoreactivity or poly- 
reactivity are properties of several HIV-specific antibodies”*** that could 
limit their use in therapies or prophylaxis. However, 35022 bound HEP-2 
epithelial cells only modestly (Extended Data Fig. 1a) and did not bind 
a panel of autoantigens (Extended Data Fig. 1b, c). Against a large panel 
of pseudoviruses, 35022 neutralized 62% of 181 isolates with an ICs 
<50 ug ml ' (Fig. 1b and Supplementary Table 3). In numerous cases 
where the ICs of 10E8 was >1 Lg ml ~ 1 that of 35022 was 100 to 1,000- 
fold lower (Supplementary Tables 1b and 3), indicating that their activ- 
ities were highly complementary. It is likely that 35022-like antibodies 
account for much of the breadth and potency of the N152 patient serum 
against clades A and B (Supplementary Table 3), whereas 10E8-like anti- 
bodies may account for much of the breadth against clade C isolates. 
Overall, the median IC;9 of 35022 for sensitive viruses was 0.033 Lig ml 2 
which is among the most potent thus far described (Fig. 1b). 

The neutralizing spectrum of 35022 was then compared to those of 
other bNabs. The ICs9 of 35022 against a panel of diverse isolates did 
not correlate with those of the bNabs VRCO1, 10E8, PG9 and PGT121 
(Extended Data Fig. 2a). In addition, a neutralization-based clustering 
analysis revealed that 35022 clustered separately from other bNabs (Ex- 
tended Data Fig. 2b). Furthermore, 35022 did not compete with other 
known bNabs when bound to virus-like particles (VLPs) (Extended Data 
Fig. 2c). Its neutralization of many pseudoviruses did not exceed 80% 


1 Laboratory of Immunoregulation, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA. “Vaccine Research Center, National Institute of Allergy 
and Infectious Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA. *The Scripps Center for HIV/AIDS Vaccine Immunology and Immunogen Discovery, The Scripps Research Institute, 
La Jolla, California 92037, USA. “International AIDS Vaccine Initiative (IAVI) Neutralizing Antibody Center, The Scripps Research Institute, La Jolla, California 92037, USA. °San Diego Biomedical Research 
Institute, San Diego, California 92121, USA. °Department of Medical Microbiology, Academic Medical Center, University of Amsterdam, Amsterdam 1100 DD, The Netherlands. 7Department of 
Microbiology and Immunology, Weill Medical College of Cornell University, New York, New York 10065, USA. 8Duke Human Vaccine Institute, Duke University, Durham, North Carolina 27710, USA. 


138 | NATURE | VOL 515 | 6 NOVEMBER 2014 


©2014 Macmillan Publishers Limited. All rights reserved 


@ = 35022 heavy chain 


IGHvt IGHD IGHJ CDR3 length VH mutation 
(amino acid) frequency (nt) 
1-18*02 5-24*01 4°02 or 4*03 16 96/276 (35%) 
35022 light chain 
IGLV IGLJ CDR3 length VL mutation 
(amino acid) frequency (nt) 


2-14*02, or 2-23*01, 


or 2-23*02, or 2-23°03 me) no 


71/288 (24%) 


tGermline alleles were determined using the IMGT database (http://imgt.org) 


PGT121 VRCO1 


b 35022 


— IC, < 1 ug mt — IC55 1-50 ug mt — IC5q > 50 ug mr 
NIH4 3BNC PGT PGT 
35022 10E8 4E10 VRCO1 5-46 117 PGQ PG16 121 128 
No. of viruses 181 180 180 177 180 180 177 177 177 177 


IC5o <50 Lg mit 62% 98% 98% 89% 85% 84% 78% 73% 64% 63% 
49% 72% 37% 75% 76% 77% 65% 59% 51% 50% 
Geometric mean IC,,* 0.056 0.22 1.3 


Median IC., 0.033 0.35 1.94 


ICgq <1 wg mit 


0.250 0.138 0.09 0.11 0.06 0.05 0.07 


0.248 0141 0.07 0.09 0.02 0.022 


*Concentration is 1g mI" 


Figure 1 | Analyses of 35022 sequence and neutralization. a, Inferred 
germline genes encoding the variable regions of 35022. b, Neutralizing activity 
of antibodies against a 181-isolate Env-pseudovirus panel. Dendrograms 
indicate the gp160 protein distance of HIV-1 primary isolate Env glycoproteins. 
Data below the dendrogram show the number of tested viruses, the percentage 
of viruses neutralized and the geometric mean or median ICs» for viruses 
neutralized with an ICs) < 50 pg ml '. 


even at high concentrations (Extended Data Fig. 3a) and its potency in- 
creased when pseudoviruses were produced in the presence of the gly- 
cosidase inhibitors NB-DNJ or kifunensine, consistent with recognition 
of high mannose (Extended Data Fig. 3b)”°. However, neutralization was 
unaffected by mutation of N-linked glycosylation sites critical for bind- 
ing of known bNabs (Extended Data Fig. 4a—c)”"’. Taken together, these 
data suggested that 35022 binds glycans, but its specificity differed from 
all previously characterized bNabs. 

Mutation of four predicted sites of N-linked glycosylation on HIVjgcsp 
(HIV-1/clade B) Env diminished neutralization potency—N88A, N230A, 
N241A and N625A (Fig. 2a and Supplementary Table 4). This result 
suggested that 35022 recognized elements of both gp120 and gp4l, a 
property that may be consistent with several recently isolated antibodies'*”. 
When mutations were introduced in the five residues on either side of 
these four sites, the V89A, T90A, K227A, T232A and $243A mutations 
each diminished neutralization (Supplementary Table 4). With the ex- 
ception of V89A and K227A, it is likely that the impact of each of these 
mutations was to disrupt the Asn-X-Ser/Thr glycotransferase sequon. 
The T627A mutation had no impact, suggesting that a glycan may not 
be present at 625 and 35022 may make a protein contact at this posi- 
tion. Overall, similar results were obtained using replication-competent 
HIV, a1 (HIV-1/clade B) viruses (Supplementary Table 5). Examination 
of the sequences of resistant pseudoviruses within clade C did not reveal 
a clear pattern of variation at each of the positions found to affect 35022 
neutralization or within the glycosylation sequon. It is therefore pos- 
sible that the resistance of clade C viruses is mediated by other factors 
such as variations in glycosylation pattern or conformation. 

35022 did not bind to a panel of soluble recombinant Env proteins (Ex- 
tended Data Fig. 5a—c), suggesting that these do not have the appropriate 
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conformation or glycosylation for binding. However, the 35022 anti- 
body did bind a recently described stabilized, cleaved, soluble trimer, 
BG505 SOSIP.664 (Fig. 2b)”°. Despite a plateau in neutralization below 
50% (Extended Data Fig. 3a) and lacking glycans at positions 230 and 
241, binding to BG505 SOSIP.664 trimer had numerous characteris- 
tics consistent with its activity against the HIVjrcsp pseudovirus. 35022 
binding was increased to trimer produced in cells treated with kifunen- 
sine or cells deficient in glycan processing (Fig. 2b), and diminished by 
mutations at positions 88 and 625 (Fig. 2c). 35022 did bind to BG505 
SOSIP trimer lacking the furin cleavage site (BG505 SOSIP.SEKS). In 
surface plasmon resonance (SPR) experiments, 35022 also bound to 
immobilized BG505 SOSIP.664 with high affinity (dissociation constant 
(Ka) = 5.6 nM) (Fig. 2d). Binding was markedly lower to the gp120- 
gp4lecro protomer and no binding was detected to the gp120 mono- 
mer (Fig. 2e). 35022 bound the uncleaved BG505 SOSIP.SEKS but no 
binding was observed to the uncleaved form lacking the SOSIP muta- 
tions (Fig. 2f). These observations, combined with the lack of binding 
of 35022 to all other soluble forms of Env tested, suggested that this 
antibody requires a trimeric structure for binding its epitope on gp120 
and gp4l (ref. 27). 

To provide an atomic-level understanding of the structure of the 
35022 antibody, we crystallized the Fab of 35022. Crystals were obtained 
that diffracted to 1.55 A resolution (Supplementary Table 6). Overall the 
structure of 35022 Fab revealed a relatively flat antigen-combining site, 
lacking long protruding loops, and flanked by the complementarity- 
determining region 1 on the light chain (CDR L1) and the 8-amino- 
acid insertion in FR3 of the heavy chain (Fig. 3a). The surface of the 
antigen-combining site was heavily altered by somatic mutation, and 
two pairs of cysteines introduced by somatic mutation in CDR L1 and 
L3 formed disulphide bonds (Fig. 3a and Extended Data Fig. 6a). 

We next sought to determine the structure of the antibody-antigen 
complex. The ability of 35022 to bind the BG505 SOSIP.664 trimer per- 
mitted imaging of the antibody-antigen complex by negative stain elec- 
tron microscopy (EM). The reconstruction of these images showed that 
three 35022 Fabs bound to the trimers at sites close to the predicted 
viral membrane (Fig. 3b and Extended Data Fig. 6b). Superposition of 
the negative stain reconstruction of the soluble BG505 SOSIP.664 with 
35022 Fab onto the BaL EM tomogram of the viral spike (Extended 
Data Fig. 6b) suggested that the viral membrane is in close contact to 
the 35022 Fab light chain. Residues Tyr 68 and Trp 69 in the light chain 
and FR3 tyrosines at residues 65 and 72 form potential surfaces of mem- 
brane association. The 35022 heavy chain was in close proximity to the 
four sites observed to contribute to the 35022 epitope in mutagenesis 
experiments (N88, N230, N241 and N625). The CDR H3 was predicted 
to interact with N625 and CDR H2 with N88. The 8-amino-acid inser- 
tion in the framework 3 heavy chain is located close to residues 88-90 
on gp120. Reversion of this insertion to germline markedly diminished 
neutralization against most pseudoviruses in our panel (Supplementary 
Table 7). 35022 binds a surface on the Env spike that is distinct from 
two other antibodies, 83ANC195 and PGT151, reported to bind gp120 
and gp41 (Extended Data Fig. 7)'*"°. 

Analysis of the 35022 site of vulnerability (Extended Data Fig. 8a—c) 
indicated that it is highly conserved. The glycans predicted at positions 
88, 241 and 625 were found to be among the most highly conserved N- 
linked glycosylation sequons of 4,265 HIV-1 sequences in the Los Alamos 
database (Supplementary Table 8). Despite this high level of conserva- 
tion, analysis of the Env gene of the patient’s plasma virus showed that 
the predicted amino acid sequence varied at the critical 35022 contacts. 
In addition to the previously published 10E8 escape mutations W680R 
and K683Q, an N230Q is predicted in one sequence, N241D in half of the 
sequences, and an N624D and N625Q in all sequences (Extended Data 
Fig. 9a). When these mutations were introduced into HIVjrcsr pseu- 
doviruses, there was a drop in neutralization with the greatest effect caused 
by the N625Q mutation found in all of the plasma sequences (Extended 
Data Fig. 9b). These data suggest that the autologous virus has escaped 
neutralization by 35022. 
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To gain insight into the prevalence of the specificity of 35022, we 
added the 35022 neutralization fingerprint to the ten that we had pre- 
viously identified (Extended Data Fig. 8d, e)**. Notably, 13 of the sera 
(38%) showed significant 35022 neutralization signals (>0.2). This level 
of prevalence was substantially higher than for the V1V2-directed res- 
ponse (typified by the PG9 antibody) or that of the 8ANC195 antibody. 
However, it was lower than the prevalence of responses to the MPER 
(50% prevalence), the CD4-binding site (53% prevalence), or the V3 gly- 
can site (82% prevalence). The neutralizing activity of sera was also mea- 
sured against HIVjxcsp pseudoviruses bearing N88A, N230A, N241A 
or N625A mutations (Extended Data Table 1). These mutations caused 
a greater than fivefold increase in IDs (50% inhibitory dilution) in more 
than half of donors, with a high level of concordance between the impact 
of each of these mutations within a given serum. These results suggested 
that 35022 is unlikely to be the product ofa unique B-cell repertoire or 
infecting virus, but rather arises commonly among patients that develop 
HIV-specific neutralizing antibodies. 

To achieve a better understanding of the mechanism of 35022’s ac- 
tivity, we examined the timing of its binding and neutralization during 
virus fusion. Given the proximity of the 35022 epitope to the mem- 
brane, it was important to perform these experiments in the context of 
Env expressed on cells or virions. In these settings, MPER-specific anti- 
bodies have limited access to the native trimer and bind best after the 
conformational changes induced by CD4 attachment”. Binding of 35022 
to Env expressed on the cell surface was low and similar to the MPER 
antibody 2F5 (Fig. 4a). 35022 binding to VLPs was weak compared to 
that of VRCO1 but similar to that of 10E8 (Fig. 4b). 35022 binding was 
slightly inhibited by soluble CD4 (sCD4) binding, suggesting the 35022 
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infecting cells. d, Kinetic assay of HIVjgy,, neutralization. See Methods for a 
description of individual formats. e, Schematic of a potential conformational 
change resulting in raising of the trimer spike required to permit access of 
35022 to its epitope. Data is representative of at least 3 experiments. Error bars 
denote one standard error of the mean. 


epitope is not induced by sCD4 under these experimental conditions. 
35022 neutralization was partially eliminated by washing of pseudo- 
virions before infection, a result consistent with limited access to Env 
on free virions (Fig. 4c)”. However, if 35022 was incubated with VLPs, 
permitted to bind to target cells, and then after 2 h the cells were washed, 
there was little impact on neutralization compared with the leave in 
format, similar to all other antibodies except the MPER 10E8 antibody 
(Fig. 4d). Similar to 2G12 and 10E8, 35022 activity was relatively high 
in the post-CD4 format (see Methods), consistent with previous work 
showing that virus-sCD4 complexes are more sensitive to neutraliza- 
tion than virus alone*®. In a post-CD4/CCRS assay, only 10E8 neutralized 
virus, consistent with previous observations (Fig. 4d)°°. Taken together 
with the structural data, these results suggest that in the context ofa lipid 
membrane, 35022 binds Env poorly before CD4 attachment. However, 
after trimer attachment to CD4, 35022 may bind to an early intermediate 
that exposes the 35022 epitope possibly by raising the Env spike within 
the viral membrane (Fig. 4e). 

Our findings concerning the 35022 antibody and its specificity have 
a number of implications for the use of antibodies in HIV therapy, pro- 
phylaxis and efforts to stimulate HIV-specific antibodies with vaccines. 
Its novel binding site and spectrum of activity against HIV strains sug- 
gest that it could complement other antibodies used in passive immu- 
notherapy or prophylaxis. In addition, the antibody is extremely potent, 
indicating that its activity in vivo may therefore persist even at low con- 
centrations. Perhaps most importantly, the novel epitope bound by the 
35022 antibody represents a new site of vulnerability that could poten- 
tially be targeted by HIV vaccines. The high prevalence of 35022-like 
neutralizing activity in HIV-infected cohorts increases the likelihood 
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that production of similar antibodies could be induced by vaccination. 
In addition, the highly specific recognition by 35022 of BG505 SOSIP. 
664 suggests that this soluble, cleaved trimer antigenically resembles 
the native Env trimer at the gp120-gp41 interface. Given the binding 
characteristics of 35022, these results underscore the possibility that 
immunogens structurally similar to the native trimer are required for 
elicitation of such antibodies”’. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Study patients. We selected the plasma and peripheral blood mononuclear cells 
(PBMCs) from the HIV-1-infected patients enrolled in the National Institute of 
Health under a clinical protocol (ClinicalTrials.gov; https://clinicaltrials.gov/ct2/ 
show/NCT00029445?term=NCT00029445&rank=1 identifier NCT00029445) 
approved by the Investigational Review Board in the National Institute of Allergy 
and Infectious Diseases (NIAID-IRB). All participants signed informed consent 
approved by the NIAID-IRB. The criteria for enrolment were as follows: having a 
detectable viral load, a stable CD4 T-cell count above 400 cells pl ', being diag- 
nosed with HIV infection for at least 4 years, and off antiretroviral treatment for at 
least 5 years. Donor N152 was selected for B-cell sorting and antibody generation 
because his serum neutralizing activity is among the most potent and broad in our 
cohort. At the time of leukapheresis, he had been infected with HIV-1 for 20 years, 
with CD4 T-cell counts of 325 cells pl = plasma HIV-1 RNA values of 3,811 copies ml~ : 
and was not on antiretroviral treatment. 

Memory B-cell staining, sorting and antibody cloning. Staining and single-cell 
sorting of memory B cells were performed following a detailed protocol recently 
published®. Briefly, a total of 140,000 CD19‘ IgA" IgD- IgM memory B cells were 
sorted and re-suspended in medium with IL-2, IL-21 and irradiated 3T3-msCD40L 
feeder cells, and seeded into 384-well microtitre plates at a density of 4 cells per well. 
After 13 days of incubation, supernatants from each well were screened for neutrali- 
zation activity using a high-throughput micro-neutralization assay against HIV-lyn.3 
and HIV-1ga..26. From the wells that scored positive in both the HIV-1yn3 and 
HIV-1gar26 neutralization assay the variable region of the heavy chain and the 
light chain of the immunoglobulin gene were amplified by RT-PCR and re-expressed 
as described previously'®. The full-length IgG was purified using a recombinant 
protein-A column (GE Healthcare). 

Generation of pseudoviruses and VLPs. HIV-1 Env pseudoviruses were gener- 
ated by co-transfection of 293T cells with pSG3 delta Env backbone and a second 
plasmid that expressed HIV-1 Env at a ratio of 2:1. HIV, a1 viruses were generated 
by transfection of a single plasmid containing the entire viral genome. 72h after 
transfection, supernatants containing pseudoviruses were harvested and frozen at 
—80 °C until further use. Glycosidase inhibitors were added to the cells at the time 
of transfection at flowing concentration, 25 [tM kifunensine, 500 1M NB-DNJ and 
20 UM swainsonine. JRCSF mutants were produced by altering the JRCSF Env plas- 
mid using QuikChange Lightning mutagenesis according to the manufacturer’s 
protocol (Agilent). VLPs for ELISA assays were produced by transient transfection 
of 293T cells with a pCAGGS Env-expressing plasmid and the subgenomic plasmid 
pNL4-3.Luc.R-E- as previously described’’. The N152 patient autologous virus was 
sequenced as previously described”. 

Neutralization assays. Neutralization activity of monoclonal antibodies or serum 
was measured using single-round HIV-1 Env-pseudovirus infection of TZM-bl cells 
as described previously’. Heat-inactivated patient serum or monoclonal antibody 
was serially diluted fivefold with Dulbecco’s modified Eagle medium-10% FBS 
(Gibco), and 10 tl was incubated with 40 il of pseudovirus in a 96-well plate at 
37 °C for 30 min. TZM-bI cells were then added and plates were incubated for 48 h. 
Assays were then developed with a luciferase assay system (Promega), and the 
relative light units (RLU) were read on a luminometer (Perkin Elmer). Washing of 
pseudovirions to determine access by antibody to the trimer on free virus was per- 
formed as described previously*’. Neutralization in various formats to determine the 
timing and mechanism of neutralization were performed as described previously’. 
Briefly, in the standard ‘leave in’ format monoclonal antibodies were mixed with 
VLPs 1h before infection of CF2.CD4.CCRS cells with no subsequent washing. 
Alternatively, in a ‘cell wash’ format, virus and antibody were incubated with tar- 
get cells for 2 h, followed by a wash to remove any unattached virus. In the post-CD4 
format, VLPs were premixed with 3 1g ml~ | of sCD4 for 15 min, then incubated 
with antibody for 1 h before adding to cells that lack CD4 but express CCR5 (CF2. 
syn.CCR5 cells). In the post-CD4/CCRS5 format, SOS VLPs were permitted to attach 
to cells for 2h, unbound VLPs were washed away and graded concentrations of 
antibodies were added before infection was activated with DTT. 

Prediction of prevalence of 35022-like antibodies in patient serum. An anti- 
body neutralization fingerprint, the pattern with which an antibody neutralizes a 
panel of diverse viral strains, was used to delineate the structural epitope recognized 
by that antibody***’. For each serum, neutralization on a panel of 21 HIV-1 strains 
was represented as a combination of the neutralization fingerprints for a reference 
set of antibody specificities targeting the other four major sites of Env vulnerability 
as well as the 35022 fingerprint, to obtain an estimate of the relative contribution 
of each of these specificities to neutralization by the given serum. The neutraliza- 
tion behaviour of sera was deconvoluted from a cohort of 34 donors, each with 
neutralization breadths of greater than 50%. 

Binding assays. HIVyu2 gp160 extracellular domain (gp140) foldon trimer, gp120 
and gp41 monomers were produced as described previously*®. HIVgar (clade B), 
HIVema3s (CRFO1_AE), HIVensa (clade C), HIVo6zmes1 (clade C), HIVo3111975 
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(clade E) gp120 proteins, and the HIVyg3z (clade A), HIVensa (clade C), HIVuga1 
(clade D) and HIVgp39 (clade F) gp140 monomers were obtained through the NIH 
AIDS Research and Reagent Program. For ELISA assays each antigen (2 jig ml") 
was coated on 96-well plates overnight at 4 °C. Plates were blocked with BLOTTO 
buffer (PBS, 1% FBS, 5% non-fat milk) for 1h at room temperature, followed by 
incubation with antibody serially diluted in disruption buffer (PBS, 5% FBS, 2% 
BSA, 1% Tween-20) for 1h at room temperature. 1:10,000 dilution of horseradish 
peroxidase (HRP)-conjugated goat anti-human IgG antibody was added for 1 h at 
room temperature. Plates were washed between each step with 0.2% Tween 20 in 
PBS. Plates were developed using 3,3’,5,5’-tetramethylbenzidine (TMB) (Sigma) 
and read at 450 nm. ELISA assays using the BG505 SOSIP.664 trimer and mutants 
were performed as described previously**”’. Surface plasmon resonance experiments 
were performed at 25 °C ona Biacore 3000 instrument (GE Healthcare) using D7324 
or anti-histidine capture as previously described’. For the SPR experiment presen- 
ted in Fig. 2d an anti-histidine capture was used (amount of immobilized ligand 
(R,) = ~500 RU) and in Fig. 2e, fa D7324 capture was used (Ry, = ~150 RU for 
trimer and protomer; 130 RU for gp120 in Fig. 2e and R; = ~500 RU in Fig. 2f). 
ELISA assays using VLPs were performed as previously described*”*'. 
Autoreactivity assays. Reactivity to HIV-1 negative human epithelial (HEP-2) cells 
was determined by indirect immunofluorescence on slides using Evans Blue as a 
counterstain and FITC-conjugated goat anti-human IgG (Zeus Scientific)”. Slides 
were photographed on a Nikon Optiphot fluorescence microscope. Kodachrome 
slides were taken of each monoclonal antibody binding to HEP-2 cells at a 10-s 
exposure, and the slides scanned into digital format. The Luminex AtheNA Multi- 
Lyte ANA test (Wampole Laboratories) was used to test for monoclonal antibody 
reactivity to SSA/Ro, SS-B/La, Sm, ribonucleoprotein (RNP), Jo-1, double-stranded 
DNA, centromere B, and histone and was performed as per the manufacturer’s 
specifications and as previously described”. Monoclonal antibody concentrations 
assayed were 50 jig ml’. 10 til of each concentration were incubated with the lumi- 
nex fluorescent beads and the test performed per the manufacturer’s specifications. 
Structure determination and analysis. The antigen-binding fragment of 35022 
(Fab) was prepared using HRV3C digestion, as previously described**. HRV3C was 
introduced in the hinge region of the heavy chain plasmid DNA. Both light and heavy 
chain plasmids were co-transfected in 293F as described previously. The antibody, 
35022 with HRV3C IgG was purified over protein A, cleaved with HRV3C and the 
flow-through collected and run onto a size-exclusion chromatography (S200). Puri- 
fied 35022 Fab set up for 20 °C vapour diffusion sitting-drop crystallizations on the 
Honeybee 963 robot. A total of 576 initial conditions adapted from the commer- 
cially available Hampton (Hampton Research), Precipitant Synergy (Emerald Bio- 
systems) and Wizard (Emerald Biosystems) crystallization screens were set up and 
imaged using the Rockimager (Formulatrix), followed by hand optimization of crys- 
tal hits. Crystals were grown in 15% isopropanol, 25% PEG 3350, 0.2 M ammonium 
citrate pH 4.5 diffracted to 1.55 A resolution ina cryoprotectant composed of moth- 
er liquor supplemented with 15% 2R-3R-butanediol. After mounting the crystals 
on a loop, they were flash cooled and data were collected at 1.00 A wavelength at 
SER CAT ID-22 beamlines (APS) and processed using HKL-2000”’. Structures were 
solved through molecular replacement with Phaser*™“', using a previously obtained 
free structure of VRCO1 germ line as a search model. Structure solution identified 
one Fab per asymmetric unit in a P4,2,2 lattice. Refinement of the structure was 
undertaken with Phenix, with iterative model building using Coot*’. Refinement 
resulted in an Reryst value of 16.65% (Rfree = 18.22%). The structure was validated 
with MolProbity®, yielding 98.1% and 100% of residues falling within most favoured 
Ramachandran regions and allowed Ramachandran regions, respectively. All graph- 
ics were prepared with Pymol (PyMOL Molecular Graphics System). 

Electron microscopy and image processing. Negative stain EM grids were pre- 
pared as previously stated**. Data were collected using a FEI Tecnai T12 electron 
microscope operating at 120 keV, with an electron dose of ~30e” A~? and a mag- 
nification of 52,000X that resulted in a pixel size of 2.05 A at the specimen plane. 
Images were acquired with a Tietz TemCam-F416 CMOS camera using a nominal 
defocus range of 800 to 1,000 nm using the Leginon interface. Image processing 
was carried out as described previously”. The final reconstruction was performed 
using 4,746 unbinned particles, refining for 40 iterations with C3 symmetry applied. 
The resulting density was ~19 A resolution at a Fourier shell correlation (FSC) 
cutoff of 0.5. Fab fitting was carried out using the Fit function in Chimera, using 
0.0115 contour level for the map. The Fab orientation where the heavy chain is 
towards gp120 (as shown in Fig. 3b) was chosen on the basis of the two correlation 
coefficients: 0.95 (471 of 7,058 atoms outside contour) versus 0.91 (1,012 of 7,058 
atoms outside contour). In the alternate orientation (light chain towards gp120), 
the FR3 insertion did not fit within the density. The man-9 glycans were modelled 
onto the BG505 SOSIP.R6.664 (PDB ID 4NCO)* using the GLYCAM-Web server 
(http://www.glycam.com). Dominant sites of vulnerability to neutralizing antibody 
elicited during chronic infection in Extended Data Fig. 7 is shown in the context of 
an EM tomogram from the HIVgar viral spike’. 
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Extended Data Figure 1 | Analysis of 35022 autoreactivity. a, Reactivity of as positive controls and 13H1 as a negative control. c, Reactivity of 35022 
35022 with HEP-2 epithelial cells. 2F5 was used as a positive controland17bas —_ with autoantigens detected in Luminex assay. 4E10 was used as a positive 


a negative control. Antibody concentration was 25 1g ml’. All pictures are control. Synagis, an anti-RSV monoclonal antibody, was used as a negative 
shown at 400 magnification. b, SPR analysis of 35022 binding to anionic control. SSA, Sjogren’s syndrome antigen A; SSB, Sjogren’s syndrome antigen 
phospholipids. 35022 was injected over PC-CLP liposomes or PC-PS B; Sm, Smith antigen; RNP, ribonucleoprotein; Scl 70, scleroderma 70; 


liposomes immobilized on the BIAcore L1 sensor chip. 4E10 and 2F5 were used —_ Jol, antigen; CentrB, centromere B. A positive response is >120 units. 
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Extended Data Figure 2 | Neutralization similarities between 35022 and red, CD4bs; blue, glycan-V3; green, V1V2; light blue, MPER; purple, other. 
other HIV-1 bNAbs. a, Correlation (Spearman) between the neutralization 35022 (yellow) clusters separately from all other antibodies, indicating a novel 


potencies of 35022 and the indicated antibody against 172 pseudoviruses. mechanism of neutralization. ¢c, 35022 competition with other bNAbs on 
Representatives from all four major sites of vulnerability are shown. Resistant | HIVjgy, VLPs with the trimer stabilizing SOS mutations in an ELISA assay. 
strains corresponding to values of >50 1g ml! are plotted as 50. Biotin-bNAbs were titrated into the ELISA at increasing concentrations in the 
b, Neutralization-based clustering of bNabs over a set of 172 diverse HIV-1 presence of excess (10 ug ml ') cold competitor neutralizing antibodies. 


strains. A putative epitope-specific clustering cutoff is shown as a dashed line. _ Values in the table indicate percentage binding of biotin-nAbs in the presence 
Antibodies are coloured according to the respective target site of vulnerability: of cold competitor. 
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Extended Data Figure 5 | Binding specificity of 35022. a, ELISA binding of — gp41 monomers. b, c, ELISA binding of gp120 (b) and gp140 (c) monomers 
indicated monoclonal antibodies to HIVyy2 gp140 foldon trimer, gp120 and _— from different HIV-1 subtypes. 
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Extended Data Figure 6 | 35022 Fab features. a, 35022 is seenlookingdown density 2F, — F, contoured at 1c. b, Superposition of HIVgar gp160 negative 
on the combining site from the viewpoint of antigen in ribbons with the CDR _ stain (yellow surface) with the negative stain reconstruction of soluble BG505 
coloured as in Fig. 3a. Insets (bottom row) show structural details of the SOSIP in complex with 35022 (grey surface) gives an estimation of the viral 
framework 3 insertion and disulphides in CDR L1 and CDR L3 with electron membrane location relative to 35022 antibody as shown in Fig. 3b. 
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Extended Data Figure 7 | Binding site of 35022 on the HIV Env trimer. Binding site of 35022 (red) relative to those of PGT151 (blue) or 8ANC195 (green) are 
shown. 
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Extended Data Figure 8 | A new site of HIV-1 vulnerability at the interface 
of gp120 and gp41 and prevalence of targeting. a, Dominant sites of 
vulnerability to neutralizing antibody elicited by natural infection, shown in the 
context of an EM tomogram from the BaL viral spike. The viral membrane is 
positioned at the top of the spike. It is unclear if 35022 and MPER antibodies 
bind to this form of the viral spike, and approximate locations for these are 
shown in dotted outlines. b, Viral spike from the soluble BG505 SOSIP 
context, shown in the same orientation as a, with gp120 surface coloured by 
conservation from 0% to 100%, from 4,265 HIV-1 strains (white to purple for 
protomer 1 with scale shown, white to blue for protomer 2, and white to orange 
for protomer 3), with glycans shown in green when present in more than 90% of 
strains, in grey when present in 30-90% of strains and not shown otherwise. 
c, 35022-identified site of HIV-1 vulnerability comprises both conserved 
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amino acids and a cluster of glycans, including N88 from gp120 and N625 from 
gp41. N230 and N24] are not present in BG505 strain. The 35022 epitope 

is shown by a yellow dotted line. d, Neutralization fingerprints for 35022 and 
for antibodies encompassing ten different epitope specificities representing the 
other four known major sites of Env vulnerability were used to interrogate 
the serum specificities of 34 HIV-infected patients. Values (with proportional 
colour intensities) predict the fraction of serum neutralization that can be 
attributed to each antibody specificity. Possible 35022-like signals were 
predicted for 13 of the sera (values >0.2), while strong signals were observed in 
3 of the sera (values >0.3). A panel of 21 HIV-1 strains was used in the 
neutralization analysis and for computing serum breadth. e, Sites of HIV-1 
vulnerability to neutralizing antibody outlined by a yellow line. Prevalence 

in a 34-donor cohort and critical glycans are indicated. 
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Extended Data Figure 9 | Autologous virus Env sequences and the impact —_ Differences between autologous and JRCSF sequences are labelled in green. 
of variants on 35022 neutralization. a, A total of 12 single-genomeamplicons _b, 35022 neutralization of JRCSF pseudovirus or variants containing the 
from plasma of patient N152 were sequenced. Donor Env sequences autologous virus mutations from patient N152. Error bars denote one standard 
together with the reference sequences of JRCSF and LAI are aligned. Amino error of the mean. 

acids critical for 35022 neutralization of JRCSF and LAI are labelled in yellow. 
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Extended Data Table 1 | Neutralizing activity of sera or monoclonal antibodies against HIVjycsp pseudovirus with mutation in the 35022 
epitope 


ID50 Fold change* 

Sera ID WI N88A_ N230A N244A N625A N88A_ N230A N241A NG625A 
20 6864. 7396 5295 5261. 5781 09 13 13 1.2 
44 6095 2824 1177 2632 776 22 a: Ea 
45 4019 2422 2094 1678 3265 17 1.9 24 1.2 
56 655 «1124.—S «2017 89 1901 0.6 0.3 03 
57 7586 626 4981 393 355 15 

127/C 5217 820 1829 1082 789 2.9 
N6 5966214 437 436 679 
N17 62354 «1121 743 2279 ~~ 1653 
N22 9735 932 3017 1591 3523 
N32 3131 539 1238 2538 686 
N44 5439 «1532, «2110 = 2622S 2003 3.6 2.6 24 27 
N53 6690 963 1048 352 1044 
N72 6722 1484 1058 199 2482 
N9O 4275 3853 1479 1018 10576 1.1 2.9 42 0.4 
N112 5903 2344«=Ss« 67 706 «6071 
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1 3485 1605 43-343 197 256 
30 14255 5620 3314 13125 6221 
N26 19628 1853 2443 1951 3917 
N18 60183 580 1202 480 656 
N116 586 2504 2176 1948 616 
N27 1088 1446 4656 ~—-838 790 
N62 907 536 462 429 263 
N126 2502 1132 «= 1146~=S 3554. 1807 
N171 12329 5967 3125 1948 1904 

2208 21501 9337 4338 8740 1815 

2256 114671584. =Ss«1881.=Ss«2055.—Ss«1240 

2258 3164731542018 ~— 2094S 4114 
mAb IC50 Fold changet 
10E8 0.078 0.0098 0.01984 0.04513 0.01757. 0.13 03 0.58 0.23 

PGT121 0.01508 0.02161 0.02053 0.02399 0.02431 1.43 1.4 159 1.61 
PG9 —-:0.00692 0.00216 0.0022 0.00189 0.00238 0.31 0.3 0.27 ~—0.34 

35022 0.0106 >20 >20 >20 >20 


* Fold change indicates IDso of HIVircse WT/IDs0 of HIVjacsr mutant. Values with fold changes >5 are highlighted in yellow. 
+ Fold change indicates ICso of HIVjacsp mutant/ICso of HIVjacsr WT. Values with fold changes >5 are highlighted in yellow. 
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Pseudouridine profiling reveals regulated mRNA 
pseudouridylation in yeast and human cells 


Thomas M. Carlile’, Maria F. Rojas-Duran', Boris Zinshteyn', Hakyung Shin', Kristen M. Bartoli’ & Wendy V. Gilbert’ 


Post-transcriptional modification of RNA nucleosides occurs in all 
living organisms. Pseudouridine, the most abundant modified nuc- 
leoside in non-coding RNAs’, enhances the function of transfer RNA 
and ribosomal RNA by stabilizing the RNA structure” *. Messenger 
RNAs were not known to contain pseudouridine, but artificial pseu- 
douridylation dramatically affects mRNA function—it changes the 
genetic code by facilitating non-canonical base pairing in the ribo- 
some decoding centre*’°. However, without evidence of naturally occur- 
ring mRNA pseudouridylation, its physiological relevance was unclear. 
Here we present a comprehensive analysis of pseudouridylation in 
Saccharomyces cerevisiae and human RNAs using Pseudo-seq, a genome- 
wide, single-nucleotide-resolution method for pseudouridine iden- 
tification. Pseudo-seq accurately identifies known modification sites 
as well as many novel sites in non-coding RNAs, and reveals hundreds 
of pseudouridylated sites in mRNAs. Genetic analysis allowed us to 
assign most of the new modification sites to one of seven conserved 
pseudouridine synthases, Pus1-4, 6, 7 and 9. Notably, the majority 
of pseudouridines in mRNA are regulated in response to environ- 
mental signals, such as nutrient deprivation in yeast and serum star- 
vation in human cells. These results suggest a mechanism for the rapid 
and regulated rewiring of the genetic code through inducible mRNA 
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modifications. Our findings reveal unanticipated roles for pseudouridy- 
lation and provide a resource for identifying the targets of pseudouridine 
synthases implicated in human disease’. 

Although more than 100 classes of RNA modifications have been charac- 
terized, primarily in tRNA and rRNA“, only three modified nucleotides have 
been identified within the coding sequences of mRNA, N°-methyladenosine 
(m°A), 5-methylcytosine (m°C) and inosine'*’. To define the global 
landscape of RNA pseudouridylation in vivo and determine whether 
mRNAs contain pseudouridine (‘¥), we developed a high-throughput 
method to identify ‘Y in the transcriptome with single-nucleotide resolution. 
can be selectively modified with N-cyclohexyl-N’ -(2-morpholinoethyl)- 
carbodiimide metho-p-toluenesulphonate (CMC) to generate a block 
to reverse transcriptase one nucleotide 3’ to the pseudouridylated site”. 
We exploited this chemistry to determine the locations of Y using next- 
generation sequencing (Fig. 1a; see Methods). Mock-treated (— CMC) 
RNA fragments were processed in parallel to identify pseudouridine- 
independent reverse transcription stops. 

Using Pseudo-seq and stringent ‘V-calling criteria, we identified 42/51 
known Vs in rRNA and small nuclear RNA (Supplementary Table 1) 
with an observed false positive rate of 0.1%. The estimated false discovery 
rate (FDR) ranges from approximately 5% for highly expressed genes 
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Figure 1 | Genome-wide pseudouridine sequencing with single-nucleotide 
resolution. a, Schematic of Pseudo-seq library preparation. nt, nucleotides. 
b, Genome browser view of Pseudo-seq reads mapping to a 200-nt region of 
RDN25-1 (chrXII: 452168-452367) containing six known Vs, generated from 
pooled reads for n = 12 technical replicates from wild-type log-phase yeast 
cultures. Peaks of ’-dependent reads are indicated with dashed red lines. Reads 
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per million (RPM). c, A metaPsi plot of mean normalized reads (left axis) for 
+CMC (orange) and —CMC (blue) libraries. The number of Vs at each 
position in the metaPsi window is indicated (black, right axis). CMC- 
dependent reverse transcription stops are found 1 nt 3’ of known Ws. 

d, A receiver operating characteristic curve of the Pseudo-seq signal for all 
known Ws in rRNA and U2 snRNA. 
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to approximately 12.5% for lowly expressed genes. (Fig. 1b-d and 
Methods). Six out of nine false negatives were due to ‘shadowing’ from 
reverse transcription stops 3’ of the P (for example, 25S-‘?2258 was 
not detected upstream of ‘¥2260). We also identified many Ys intRNA, 
all of which occurred at known positions (Supplementary Table 2). We 
verified the single-nucleotide resolution of Pseudo-seq by profiling four 
small nucleolar RNA deletion mutants that eliminate pseudouridyla- 
tion of nine specific rRNA and snRNA target sites (Extended Data Fig. 
la-d). Similar specificity and sensitivity were achieved using different 
reverse transcription enzymes, RNA fragment lengths, CMC concen- 
trations, and truncated cDNA lengths, demonstrating the robustness 
of the Pseudo-seq method (Extended Data Fig. 2a-d). 

After validating the ability of Pseudo-seq to detect known Ws in non- 
coding RNAs, we next analysed mRNA pseudouridylation in budding 
yeast during post-diauxic growth (Agoo nm = 12) (Extended Data Fig. 3a). 
To define high confidence Pseudo-seq hits even in transcripts with sparse 
read coverage, we required reproducibility in 10/14 independent experi- 
ments (Extended Data Fig. 3b, c). Notably, we found that many mRNAs 
contain P (Fig. 2a). In total, we conservatively identified 260 ‘Y's in 238 
protein-coding transcripts (Supplementary Table 3). Relaxing our criteria 
to include Vs detected in 9/14 experiments, a category that includes the 
known ‘V56 in U2 snRNA, increased the number of candidate mRNA 
Ys to 466. We established a rough detectability threshold by determin- 
ing the lowest observed expression level of genes having sufficient reads 
for reproducible ' calling; 5,278 genes passed the cutoff. Thus, it is unlikely 
that there are substantially more mRNA Ws to be discovered in post- 
diauxic yeast. We conclude that mRNA pseudouridines are relatively 
scarce. ‘Vs were found in 5’ transcript leaders (5'TLs), coding sequences 
(CDS), and 3’ untranslated regions (3’ UTRs) with an underrepresen- 
tation of V in3’ UTRs (P= 10 4, hypergeometric test) (Fig. 2b). GUA 
valine codons were the most frequently modified, suggesting the exist- 
ence of a sequence-specific mechanism for mRNA pseudouridylation 
(Extended Data Fig. 4). 

We investigated the potential for regulation of mRNA pseudouridy- 
lation by comparing two cellular conditions with substantial differences 
in gene expression and physiology: log phase and post-diauxic growth. 
Remarkably, most mRNA Vs were regulated: 42% of the sites modified 
during post-diauxic growth were not detectably modified in log phase, 


whereas other sites, such as CDC33 ‘286, were much more extensively 
modified during exponential growth. Moreover, of the 150 modified 
sites detected in both log phase and post-diauxic growth, 62 showed 
>twofold changes in peak height between conditions indicating growth- 
state-dependent changes in the extent of mRNA modification (Fig. 2a 
and Supplementary Table 3). Importantly, we ruled out differences in 
mRNA expression as an explanation for condition-dependent differ- 
ences in ’ detection (Extended Data Fig. 5). Thus, the process of mRNA 
pseudouridylation is regulated in response to environmental cues. 

Yeast non-coding RNAs (ncRNA) have been extensively character- 
ized for post-transcriptional modifications. Nevertheless, we identified 
74 novel pseudouridylated sites in ncRNAs (Supplementary Table 4). 
A few, like ‘274 in the RNase MRP RNA (NME1) (Fig. 2c), were con- 
stitutively modified, while most, including the previously described ‘¥56 
and ‘P93 in U2 snRNA”, were induced during post-diauxic growth (Ex- 
tended Data Fig. 6a). snoRNAs were notably enriched among ncRNA 
classes with regulated pseudouridines: 19/29 H/ACA and 14/47 C/D 
snoRNAs showed one or more sites specifically modified in cells grown 
to high density (Fig. 2d, Extended Data Fig. 6b, c). Pseudouridylation of 
rRNA sites changed very little: only one site, 25S-‘¥?2314, changed more 
than twofold. However, owing to the stability of rRNA and the greatly 
reduced rate of ribosome synthesis during post-diauxic growth, we can- 
not rule out production of a minority population of differentially modi- 
fied ribosomes in dense cultures. 

We next sought to define the molecular basis for targeting of novel 
mRNA and ncRNA sites for pseudouridylation. ‘¥s in rRNA, snRNA and 
tRNA are produced by two classes of enzymes with distinct modes of 
target recognition. The first class, which includes yeast Cbf5 and human 
dyskerin, associates with H/ACA snoRNAs to direct pseudouridylation 
of sites that base pair with the snoRNA guide sequences, while the second 
class recognizes its targets without the aid of an RNA guide. We com- 
putationally identified 157 unique sites in mRNAs containing perfect 
matches to canonical snoRNA targets (Supplementary Table 5). When 
these potential pseudouridylation sites were considered in aggregate, stat- 
istically significant pseudouridylation was detected (Fig. 3a, Extended 
Data Fig. 7a, b), which increased with the number of base pairs to the 
snoRNA guide sequence and was specific to post-diauxic growth (Ex- 
tended Data Fig. 7c, d). However, only three such sites passed our threshold 
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Figure 2 | Yeast mRNAs and ncRNAs are inducibly pseudouridylated. 

a, c, CMC-dependent peaks of reads are indicated with a dashed red line. The 
median Pseudo-seq peak heights in each condition are given ~ s.d.; negative 
peak values occur when the reads in the —CMC library exceed those in the 
+CMC library. Traces are representative of four wild-type biological replicates. 
a, Pseudo-seq reads in RPS28B (chrXII: 673163-673336), MRPS12 (chrXIV: 
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100 bases 
694489-694736) and CDC33 (chrXV: 50560-60875). b, Summary of 
locations of ‘Ys within mRNA features. CDS, coding sequence. c, Pseudo-seq 
reads in U5 snRNA (snR7-L, chrVII: 939458-939671), RNase MRP RNA 


(NME1, chrXIV:585585-585925) and an H/ACA snoRNA (snR37, chrX: 
228090-228475). d, Summary of novel Ps identified in ncRNA. 
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Figure 3 | Mechanisms of mRNA pseudouridylation. a, MetaPsi plot of 
mean normalized reads for computationally predicted snoRNA-dependent 
targets in mRNA from high density cultures (orange), log phase cultures (blue), 
+CMC (solid) and —CMC (dashed). Indicated are the predicted snoRNA 
target site (black, dashed), and the expected peak of CMC-dependent reads 
(black, dotted). Reads were pooled from four wild-type biological replicate 
libraries. b, Summary of Vs identified by Pseudo-seq as PUS-dependent. The 
few PUS6- and PUS9-dependent Vs are not shown. The locations of Vs within 
mRNAs and the distribution of ‘’s among ncRNA types are indicated. NC, 
noncoding sequence. cf, Sequence motifs surrounding PUS1- (c), PUS2- (d), 
PUS7- (e) and PUS4-dependent (f) ‘s in mRNAs, generated by WebLogo 
3.3. g, PUS7-dependent Pseudo-seq reads in MRPL36 (chrIlI: 484301-484400). 


for Y calling on their own (Extended Data Fig. 7e). Thus, it is probable 
that many additional mRNAs are pseudouridylated at a low level and 
our estimate of 260 mRNA s represents a conservative minimum. 
Because most pseudouridylated sites showed no significant complemen- 
tarity to snoRNA guide sequences, we next investigated whether snoRNA- 
independent pseudouridine synthases are responsible for modifying 
sites in mRNAs. Yeast has nine pseudouridine synthase (PUS) genes, 
all of which are expressed in both log phase and post-diauxic growth. 
We profiled the eight viable PUS deletion strains (pus4) grown to high 
density and identified mRNA targets for each Pus protein, with the 
exception of Pus5 whose only known target is the 21S mitochondrial 
rRNA” (Fig. 3b, Extended Data Fig. 8a, b and Supplementary Table 6). 
The largest number of mRNA and novel ncRNA Ws could be assigned 
to Pus1, a member of the TruA family that constitutively modifies mul- 
tiple positions in cytoplasmic tRNAs and one position in U2 snRNA by 
a mode of target recognition that is incompletely defined. Whereas 
known Pus1-dependent tRNA targets showed constitutive pseudour- 
idylation as expected, most of the mRNA targets showed increased 


LETTER 


modification during post-diauxic growth (Extended Data Fig. 8c, Sup- 
plementary Table 3). The mRNA targets of Pus] showed little similarity 
at the primary sequence level, consistent with the proposed structure- 
dependent mode of target recognition by this enzyme (Fig. 3c, Extended 
Data Fig. 8d)’, while Pus2, a close paralogue of Pusl, had 14 mRNA 
targets with a weak sequence consensus distinct from Pus1 (Fig. 3d, 
Extended Data Fig. 8e). Intriguingly, the Pus] targets included seven 
genes encoding five proteins of the large ribosomal subunit, a signifi- 
cant enrichment (P = 0.025). Our comprehensive pseudouridine pro- 
filing more than doubles the number of known substrates of Pus] and 
Pus2, identifies unanticipated mRNA targets, and provides the first 
demonstration of regulated pseudouridylation by these enzymes. 
Unlike Pus1 and Pus2, the mRNA targets of Pus4 and Pus7 contained 
clear consensus sites in agreement with the known sequence requirements 
for these enzymes to modify their canonical tRNA targets, UGYAR 
for Pus7 and GU'‘YCNANNC for Pus4 (Fig. 3e-g, Extended Data 
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Figure 4 | Regulated pseudouridylation of human RNAs. Pseudo-seq was 
performed on HeLa cells grown in the presence or absence of serum for 24h. 
a, b, CMC-dependent peaks of reads are indicated with a dashed red line. The 
median Pseudo-seq peak heights in each condition are given + s.d. Traces are 
representative of n = 4 (—serum), and n = 5 (+serum) biological replicates. 
Genome browser views represent spliced transcripts. a, Pseudo-seq reads from 
RPL19 (12-460), and ATP5E (154-437). b, Summary of locations of Vs within 
mRNA features. c, Pseudo-seq reads from MALATI (5081-5636) and RN7SK 
(142-307). d, Summary of novel Vs identified in ncRNA. 
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Fig. 8f-h)***°. We also identified novel targets for Pus3 (20 mRNA, 
1 ncRNA), Pusé6 (3, 1) and Pus9 (1, 0), and, in total, assigned 52% of 
mRNA Ws and 31% of novel ncRNA Ws to individual Pus proteins. 
The remaining sites may be modified by the essential protein Pus8 
and/or may be redundantly targeted by multiple Pus proteins. Together, 
these results reveal unanticipated diversity in Pus targets and show that 
Pus-dependent non-tRNA sites are regulated in response to changing 
cellular growth conditions. The discovery of novel mRNA substrates 
for Pus proteins raises the possibility that other tRNA modifying enzymes 
may likewise target mRNAs. 

As the pseudouridine synthases that modify yeast mRNAs are con- 
served throughout eukaryotes, we investigated whether regulated mRNA 
pseudouridylation also occurs in mammalian cells. Human cervical car- 
cinoma (HeLa) cells were profiled during normal proliferation and 24h 
after serum starvation. Pseudo-seq detected known pseudouridines with 
good sensitivity and specificity (Supplementary Table 7, Extended Data 
Fig. 9a—c). By restricting our analysis to more highly expressed genes 
and requiring reproducibility in four independent biological replicates, 
we conservatively identified 96 ‘Vs in 89 human mRNAs (Supplemen- 
tary Table 8). As in yeast, some ‘Y modifications in human mRNAs were 
regulated by cellular growth state (Fig. 4a, Extended Data Fig. 10a, b), 
and modified sites were found throughout the transcript (Fig. 4b). We 
also discovered novel ‘Ys in human ncRNAs, including 4 previously 
unknown sites in rRNA (Extended Data Fig. 10c, Supplementary Table 9) 
and sites in long non-coding (Inc-), sn- and snoRNAs (Fig. 4c, d). Thus, 
the Pseudo-seq approach is broadly applicable to diverse organisms 
and growth states. Moreover, the phenomenon of regulated mRNA pseu- 
douridylation is conserved from yeast to humans. 

In summary, Pseudo-seq provides comprehensive analysis of RNA 
pseudouridylation with single-nucleotide resolution and reveals that 
endogenous mRNAsare specifically pseudouridylated in a highly regu- 
lated manner in yeast and human cells. Because stabilizes RNA structure, 
mRNA pseudouridylation could alter translation initiation efficiency”, 
ribosome pausing”’, RNA localization” and regulation by RNA inter- 
ference”, to name a few aspects of mRNA metabolism known to be 
affected by RNA structure, although we cannot exclude the possibility 
that many instances of mRNA pseudouridylation may be functionally 
silent. However, given recent evidence that pseudouridine profoundly 
affects decoding by ribosomes from diverse organisms’, our results also 
raise the possibility of widespread regulated rewiring of the genetic code. 
Finally, this work suggests that diseases associated with mutations in 
pseudouridine synthases, including mitochondrial myopathy and side- 
roblastic anaemia (MLASA)", dyskeratosis congenita’” and lung can- 
cer’’, could be due to misregulation of mRNA targets. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Yeast strains and growth. All yeast strains are Saccharomyces cerevisiae BY4741 
or BY4742 derivatives (BY 4742: wild type (YWG11), snr374 (YWG287, YWG343), 
snr43A (YWG293), snr494 (YWG299, YWG354), snr814 (YWG322, YWG376) 
pus4A (YWG1251, YWG1252), BY4741: pus1A (YWG1209), pus1A (YWG1209), 
pus2A (YWG1210), pus34 (YWG1211), pus5A (YWG1212), pus6A (YWG1213), 
pus7A (YWG1214), pus94 (YWG1215)). The snoRNA deletion strains and pus44 
strains were made using PCR-based deletion cassettes*’. The other pusA strains 
were obtained from the Yeast Deletion Collection”. Strains were grown at 30 °C in 
YPAD (1% yeast extract, 2% peptone, 0.01% adenine hemisulphate, 2% glucose), 
and were harvested by centrifugation in log phase (A¢oo nm ~ 1), or at high density 
(A¢oonm * 12-15). 

Cell culture. HeLa (human cervix adenocarcinoma; CCL-2, ATCC) cells were 
cultured in DMEM (D6429; Sigma) supplemented with 10% fetal bovine serum 
(FBS; Atlanta Biologicals). Cells were grown at 37 °C with 5% CO, under standard 
laboratory conditions. For serum starvation cells were plated at a density of 5 X 10° 
per 150-mm plate in DMEM+ 10% FBS, 24h before the experiment. Cells were 
then washed three times in PBS, before the addition of either serum-free medium 
(DMEM, no FBS) or full medium containing FBS (DMEM+ 10% FBS) for 24h. 
Pseudo-seq library preparation. Yeast total RNA was isolated by hot acid phenol 
extraction, followed by isopropanol precipitation®*. HeLa total RNA was isolated 
using QIAzol (Qiagen; 79306). PolyA+ RNA was isolated from 10 mg (yeast) or 
2 mg (HeLa) total RNA using oligo dT cellulose beads (NEB; $1408S), as described**. 
For some libraries, two sequential rounds of polyA selection were performed. Yeast 
RNA was fragmented in 10mM ZnCl, at 94°C for 5 min (total RNA) or 55s 
(polyA+ RNA), and HeLa RNA was fragmented in 10mM ZnAcetate at 60 °C 
for 10 min. Fragmented RNA was then precipitated. 

CMC treatment of RNA fragments was as follows”. RNA was denatured in 
5mM EDTA at 80 °C for 2 min, and then placed on ice. 0.5 M CMC in BEU buffer 
(7 Murea, 4 mM EDTA, 50 mM bicine, pH 8.5) was added to a final concentration 
of 0.2 or 0.4M CMC (4X RNA volume). CMC modification was carried out at 
40 °C for 30 min, followed by ethanol precipitation. Subsequent reversal of modi- 
fication of Us and Gs was carried out in NaCO3 buffer (50 mM sodium-carbonate, 
pH 10.4,2 mM EDTA) at 50 °C for 2 h, followed by precipitation. In parallel mock- 
treated samples were incubated in BEU buffer without CMC. 

RNA fragments were dephosphorylated with CIP (NEB; M0290) and PNK (NEB; 
M0201), followed by size selection and elution of 80-100, 100-120, and 120-140-nt 
fragments on an 8% urea-TBE polyacrylamide gel electrophoresis (PAGE) gel, fol- 
lowed by precipitation. RNA fragments were eluted from gel slices overnight at 4 °C 
with gentle rocking in 400 jl RNA elution buffer (300 mM NaOAc pH 5.5, 1mM 
EDTA, 100 Uml~? RNasin (Promega; N2615)). Ligation of a pre-adenylated 3’ 
adaptor (IDT; /5Phos/TGGAATTCTCGGGTGCCAAGG/3ddC/) was carried out 
with T4 RNA ligase (NEB; M0204) in 1X buffer without ATP (50 mM Tris-HCl, 
pH7.8, 10 mM MgCl, 10 mM DTT) at 22 °C for 2.5 h, followed by precipitation. 

Reverse transcription (RT) was carried out using AMV-RT (Promega; M5108) 
with the following conditions. The reverse transcription primer (IDT; /5Phos/GAT 
CGTCGGACTGTAGAACTCTGAACCTGTCGGTGGTCGCCGTATCATT/iSp18/ 
CACTCA/iSp18/GCCTTGGCACCCGAGAATTCCA) and RNA were denatured 
and annealed in reverse transcription buffer (50 mM Tris-Cl pH 8.6, 60 mM NaCl, 
10mM DTT). After annealing, dNTPs (3.3 mM each final) and MgCl, (6 mM final) 
were added, and reverse transcription was carried out at 42 °C for 1h. Truncated 
cDNAs were size-selected and purified on an 8% urea-TBE PAGE gel, followed by 
precipitation. cDNAs were eluted from gel slices overnight at room temperature with 
gentle rocking in 400 pil DNA elution buffer (300 mM NaCl, 10 mM Tris, pH 8.0). 

cDNAs were circularized with circLigase (Epicentre; CL4115K), and amplified 

by PCR with Phusion (NEB; M0530) with the forward primer (IDT; AATGATAC 
GGCGACCACCGA), and a barcoded reverse primer (IDT; CAAGCAGAAGAC 
GGCATACGAGATXXXXXXGTGACTGGAGTTCCTTGGCACCCGAGAATT 
CCA). PCR products were gel-purified, precipitated and sequenced on an Illumina 
HiSeq 2000. 
Sequencing data analysis. RNA-seq data was analysed with in-house Bash and 
Python scripts unless otherwise specified. For yeast libraries, adaptor sequences 
were trimmed using Cutadapt*’, and were subsequently mapped to the S. cerevisiae 
genome downloaded from the Saccharomyces Genome Database (SGD) on 9/2/ 
2011. Mapping to the genome and defined splice junctions (UCSC, sacCer3) was 
performed using Tophat2**. Multiply mapping reads were allowed. Using SAMtools 
to exclude multiply mapping reads affected ‘Ys called in repetitive or paralogous 
features, but not ‘Ys identified in other features*’. 

Trimmed reads from HeLa libraries were mapped with Bowtiel allowing up to 2 
mismatches to a database of spliced transcripts (hg19 sequence, and transcripts 
downloaded from UCSC on 1 August 2012) containing the transcript with the longest 
coding sequence, or the longest transcript for non-coding genes”*. Multiply mapping 
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reads were allowed for generation of receiver operating characteristic curves and 
MetaPsi plots, but were excluded for ‘V predictions. 

Identification of . The yeast transcriptome (downloaded from SGD on 9 February 
2011) with annotated 5’ and 3’ UTRs was used to identify new sites of pseudouridyla- 
tion”. Where annotated 5’ and 3’ UTRs were not available, median UTR lengths 
were used. To identify new sites of pseudouridylation in HeLa cells the human 
transcriptome described above was used. For a given +/—CMC library pair, the 
—CMC libraries were first scaled to the size of the +CMC libraries. Peak values 
were calculated for each position 1 nt 3’ of a U (peak position) in all features with 
an average per nucleotide read coverage greater than a specified read cutoff: 


+ wets 
Bee I eee 
Where r* andr indicate the number of reads whose 5’ ends map to the position 
being examined in the +CMC and -CMC libraries, respectively, wr* and wr" rep- 
resent the numbers of reads whose 5’ ends map to a window centred at the position 
being examined (exclusive of reads at that position), and ws specifies the size of this 
window (exclusive of that position). Sites with peak positions greater than a spe- 
cified peak cutoff were flagged as potential . To filter out false positives reproducib- 
ility of peak calling in a certain number of libraries was required. 

Window size (ws) was set to 150 for all analyses. Only features surpassing an 

average reads/nt threshold (read cutoff) were considered. For high density yeast, 
the read cutoff was set to 0.0, the peak cutoff was set to 1.0, and reproducibility was 
required in 10 of 14 libraries. For log phase yeast, the peak values from log phase 
data were calculated for the high density identified ‘¥. For both serum fed and 
serum starved HeLa cells, the read cutoff was set to 1.0, the peak cutoff was set to 
2.0, and reproducibility was required in 4 of 4 libraries. A subset of called ‘¥ in HeLa 
cells came from very narrow (<20 nt) regions of uniquely mapping reads. These 
calls were considered unreliable and were removed. 
MetaPsi plots and ROC curves. For a given , the reads at each position ina 51-nt 
window centred at the ‘¥ were normalized to the average reads per nucleotide within 
the window. These windows of normalized reads were then averaged for all known 
f in yeast rRNA and U2 or human rRNA and snRNA, yielding a metaPsi. Given 
the close spacing of ‘¥ in these features, the number of ¥ at each position in the 
metaPsi window was also plotted. 

To generate receiver operating characteristic (ROC) curves for a given +/—CMC 
library pair, the —CMC libraries were first scaled to the size of the + CMC libraries, 
and peak values were calculated (see above) for each position 1 nt 3’ ofa U or ¥ in 
the features above. Additionally, a -CMC peak value was determined: 
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Parameters are as defined earlier. A range of 10,000 equally spaced cutoff scores 
were chosen spanning the range of observed peak values. At each cutoff score, the 
true positive and false positive rates were calculated, and plotted. 

Estimation of false discovery rate. The lower bound for FDR was estimated from 
the observed FDR for the rRNA. The upper bound of FDR for lowly expressed genes 
was estimated by randomly down-sampling reads in the rRNA and U2 snRNA to 
a level of coverage comparable to that of lowly expressed mRNAs with ‘Ys. These 
randomizations were performed 14 times followed by '¥ calling on down-sampled 
libraries as described above. This number should be considered a rough estimate 
because the ribosomal RNA may not provide a perfect basis for estimating the FDR 
of calling in mRNA. The observed number of false positives in the rRNA and 
snRNA under the criteria used to call ‘¥s in mRNA was two incorrect ¥ calls in 
1905 U residues (0.1%). 

Defining regulation and factor-dependence. To determine if a given ‘¥ was 
condition dependent the median peak values between two conditions were compared. 
For yeast only wild-type peak values were included. A twofold or greater change 
between conditions was considered regulated. 

‘Vs were identified as Pus-dependent if the peak heights in both biological repli- 
cates ofa given pusJ strain were less than 25% of the median peak height for that 'V 
across all libraries. For a given ‘¥ to be considered PUS-dependent, we required at 
least one replicate for a given pus/ strain to have sufficient reads in the 150-nt 
window surrounding the P to be greater than 25% of the median reads in that 
window for all libraries. 
snoRNA target site predictions and analysis. To identify potential sites of pseu- 
douridylation within yeast mRNAs, the yeast transcriptome (described above) was 
scanned for sites that perfectly match the known target sequences of all yeast Box 
H/ACA snoRNAs, allowing mismatches at bases that are unpaired in known target 
sites”. 

For the analysis presented in Extended Data Fig. 7a, b, 10,000 randomizations 
were performed. For each trial, a random U was chosen for each non-repetitive com- 
putationally predicted snoRNA target, and was matched to the same gene, and 
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transcript feature (5’ UTR, CDS, 3’ UTR). Each randomized set of Us was used to 
generate a metaPsi from the pooled reads of four libraries, and the differences in 
mean normalized reads between the +CMC and —CMC poolsat the peak position 
were calculated. The distributions of these values were plotted in a histogram, and 
compared to the values for the computationally predicted snoRNA target sites. 
Plotting and other analyses. Sequences and structures of tRNAs were obtained 
from tRNAdb, and P locations were previously published*~**. Motifs were gener- 
ated using WebLogo 3.3 using default settings, and the modified position was chan- 
ged to a P after logo generation. UCSC genome browser was used to generate 
plots of rpm data, and matplotlib was used to generate the remainder of graphs“. 
RPKMs (reads per kilobase of exon sequence per million exon reads) were calcu- 
lated from —CMC libraries. GO analysis was performed using the YeastMine feature 
of SGD (http://yeastmine.yeastgenome.org/). The set of genes whose coverage was 
sufficient to reproducibly call ‘¥s was used as a background set. These 5,278 genes 
had average RPKMs in post-diauxic wild type cultures of = 9.25, the level of express- 
ion of the lowest expressed pseudouridylated mRNA called by our algorithm. 
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Extended Data Figure 1 | Detection of specific snoRNA target sites by (chrXII: 454111-454160, left), and U2 snRNA (LSR1, chrII: 681791-681840, 


Pseudo-seq. Pseudo-seq was performed on wild-type (n = 4), snr37A (n= 2), right) showing SNR81-dependence of 25S-‘¥1052 and U2-‘¥42. c, Pseudo-seq 
snr81A (n = 2), snr43A (n = 2) and snr49A (n = 2) yeast strains. Cultures were reads in RDN58-1 (chrXII: 455466-455515) showing SNR43-dependence of 
harvested at high density (a, b) or log phase (c, d). ‘Ys dependent on the deleted —_5.8S-‘¥73. SNR43-dependent 25S-'¥960 was not consistently detected in wild 
snoRNA are indicated in red. CMC-dependent peaks of reads are indicated type owing to an overlapping CMC-independent reverse transcription stop. 
with dashed red lines. Traces are representative of indicated number of d, Pseudo-seq reads in RDN18-1 (chrXII: 457361-457610) showing SNR49- 
biological replicates. a, Pseudo-seq reads in RDN25-1 (chrXII: 452221-452270) dependence of 18S-'¥302, 18S-¥211, and 18S-‘¥120. 25S-‘¥990 was also 
showing SNR37-dependence of 25S-'¥2944. b, Pseudo-seq reads in RDN25-1 _ detected as SNR49-dependent (data not shown). 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


a 12 50 
— AMV +CMC 
== AMV -CMC 
10h — 
SSII1+CMC | 45 
~-  SSIIl-CMC 
2 — W Positions =| 
oO ne} 
oO fo} 
to oO. 
mod = P= 
NS g = 
s 2 2 
J 
3 = 7) 
c =] 
c Qa 
§ 2 
= oa 
are — AMV +CMC 
0.2 314 J 
ae == AMV-CMC 
ht — ssill+cmc 
re == §SIll-CMC 
0.0 i i f 4 
0.0 0.2 0.4 0.6 0.8 1.0 
Position (nt) 1-specificity 
b 12 : . r : 50 1.0 : r : r 
— _ 115-130 nt +CMC 
== 115-130 nt -CMC 
107 — 130-145 nt +CMC | 4, 
== 130-145 nt -CMC 
3 — _ W Positions « 
oO nol 
oO fo} 
= Q. 
= 5 2 
5 = a 
c a 
= 
a 115-130 nt +CMC 
0.2 Iv 
4 115-130 nt -CMC 
a a 130-145 nt +CMC 
pie 130-145 nt -CMC 
0.94@— 
0.0 0.2 0.4 0.6 0.8 1.0 
Position (nt) 1-specificity 


c 14 r r , , 50 
== 0.0M CMC 
tol — _ 0.2M CMC 
— 04MCMC Jy, 
— W Positions 
2) r= 
8 q 
oO nol 
oO fo] 
ba QD. 
no] = 2 
N g = 
: s 
=i 
6 = a 
c S 
c a 
8 2 
= 
oa = ~~ 0.0MCMC | 
7 
Pra — _ 0.2M CMC 
ii” — _ 0.4MCMC 
940 02 04 0.6 08 1.0 
Position (nt) 1-specificity 
d 12 . ; . 1 50 1.0 r ; r ; 
— Short cDNAs +CMC 
== Short cDNAs -CMC 
10 —— Long cDNAs +CMC | 4, 
- = Long cDNAs -CMC 
3 8 — YW Positions n= 
oO nol 
oO ° 
S a. 
8 3 s 
26 = 
2 5 
= 
OS), oe — Short cDNAs +CMC 
0.2 4 7: J 
2 2 f- = = Short cDNAs -CMC 
a 
1a < — Long cDNAs +CMC 
4 f- * Long cDNAs -CMC 
Va 
0) 0.0 
0.0 0.2 0.4 0.6 0.8 1.0 
Position (nt) 1-specificity 


Extended Data Figure 2 | Technical variations of Pseudo-seq give similar 
results. a-d, MetaPsi plots (left), and ROC curves (right) for various library 
prep conditions n = 1 for each condition. CMC-treated samples (solid) and 
mock-treated samples (dashed) are indicated. a, Comparison of AMV-RT 
(orange) and SuperScript III (blue) (0.2 M CMC; 115-130 nt, 100-115 nt 
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fragments respectively). b, Comparison of 115-130 nt (orange), and 130-145 nt 
(blue) RNA fragment sizes (AMV-RT; 0.2 M CMC). c, Comparison of 0.2 M 
CMC (blue), and 0.4 M CMC (orange) (AMV-RT; 115-130 nt RNA). 

d, Comparison of shorter (orange) and longer (blue) truncated reverse 
transcription fragment sizes (AMV-RT; 115-130 nt RNA; 0.2 M CMC). 
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a, b, Pseudo-seq was performed on wild-type (n = 4), snr814 (n= 2), pus14d _ are dependent on growth to high density. b, Pseudo-seq reads in U3a 

(n = 2) and pus7A (n = 2) yeast strains grown to high density. CMC dependent snoRNA (SNRI7A, chrXV: 780461-780560) showing snR17A-'¥369 
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Extended Data Figure 7 | Analysis of potential snoRNA targets. and high density samples are indicated by arrows. b, An averaged metaPsi plot 


a-d, Pseudo-seq was performed on wild-type yeast in log phase, or grown to for all randomizations. c, d, +CMC (c) and —CMC (d) MetaPsi plots for 
high density. Reads from n = 4 biological replicate libraries for each condition computationally predicted Vs separated by base pairing. Sites with 8 or more 
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Extended Data Figure 8 | Mechanisms of Pus-dependent biological replicates. Pseudo-seq reads for RPL14A (a, chrXI: 431901-432200) 


pseudouridylation. a, b, Summaries of the PUS-dependence of called ‘Ys using and PDI (d, chrIlI: 49401-48760) showing PUS1- and PUS7-dependency, 
higher stringency cut-offs (10/14 libraries) (a) and lower stringency cut-offs (9/ _ respectively. Both are dependent on growth to high density. d, e, g, h, WebLogo 
14 libraries) (b). c, f, CMC-dependent peaks of reads are indicated with dashed 3.3 was used to generate motifs for PUS1 (d), PUS2 (e), PUS7 (g) and PUS4 (h). 
red lines. Traces are representative of n = 4 (wild type), and n = 2 (pus) 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


a os 
= 
oO 
c 
Oo 
= 
9? 
oO 
ea 
o) 
ade 
= 
oO 
= 
oO 
= 
Oo 
+ 3 : aa : 
wv | I | I 
RNA2885 100 bases P47 7WNYNYNlYl hf 
b 14 ——120 © 10 
— HeLa +CMC 
— HeLa -CMC 
ie ' 100 
— W Positions S08 
3 10 <a 
® soy 
3 2 $06 
o 8 oS 
N 5s 2 
= 60 7 © 
£ = 
cS) < s 0.4 
c 40a 8 
§ 4 gS 
= S 
2 0.2 
2 20 aa 
0 0 0.0 
20 a0 0 10 20 0.0 0.2 0.4 0.6 0.8 1.0 
Position (nt) as fess 
False positive rate (1-specificity) 
Extended Data Figure 9 Positive controls for human RNA Pseudo-seq. normalized reads (left axis) for +CMC libraries (orange), and —CMC 
a, Pseudo-seq reads for RDN28S5 (1516-1765) containing five known Ps libraries (blue). The number of ‘’s at each position in the metaPsi window is 
(28S-'P1536, 28S-'¥1582, 28S-‘¥1677, 28S-\¥1683 and 28S-‘¥1744). CMC- indicated (black, right axis). c, A ROC curve of the Pseudo-seq signal for all 
dependent peaks of reads are indicated with dashed red lines. Traces are known Ws in rRNA and snRNA. 


representative of n = 5 biological replicates. b, A metaPsi plot of mean 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


8.0 ; : ; + : ; : : 
a b 745, ° win both conditions (68) e 
“| @ W in + serum (29) 5 
ewWin- . 
5 7.0| ° W in - serum (4) : 
B 
+ 6.5} 
2 ; 
= 6.0} as ‘ 
oO ec” 
W + serum W - serum = 5.5 - 
> eo,? aa 
is oe 
S ee * Aes s 
oD Per oe 
245 ow & ST 
ra 23° 
4.0 
3.5 
35 40 45 50 55 60 65 7.0 75 8.0 
log10(avg RPKMs), -serum 
c RNA18S5-296 RNA18S5- 1046, 1177 
Ss = iG i 
Qa = : H ae 
c ; DE i 
Q | : - : 
opener PR ee We mms [ory |e eee a el aE» gee er, PSO pUts Saar nem nats | ae 
g : Z | 
® e ‘ 
Ty), bo 
= t 4 
oa : : 
ce Po i. 
Q - 
| 
re : : 
+ Li a | a a_i bes . L 
w ti I tol I I 
EEE BSE 
RNA28S5- 2839, 2843 RNA28S5-4532 
Qa : 
Q 
5 il ‘er iv : kn a mu: boa = i J ul 1 
« ui 
a oe i 
oO oe ‘ 
=|, i : 
= 3 ‘ ‘ ; i 
oO. a : : : 
c i : 
g 
= 
w ll 1 l oa I i 


D2d9dIIPIPIDIIIIIIIIIIIIDIIIDIDIIIIIIIPIIIDIIIIIIIIIDIDIDIIIIIIIIIIIIIIIIIIIIDIPIDIDIDIIIIPIIIIIDD 


RNA25S5 100 bases ————____ 

Extended Data Figure 10 | New pseudouridines in human RNAs. a, A Venn 
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(red) are indicated. c, Pseudo-seq reads for RDN18S5 (184-411) (top, left), 
RDN18S5 (1015-1210) (top, right), RDN28S5 (2713-3108) (bottom, left), and 
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Crystal structure of the RNA-guided immune 
surveillance Cascade complex in Escherichia coli 


Hongtu Zhao’, Gang Sheng’, Jiuyu Wang’, Min Wang’, Gabor Bunkoczi®, Weimin Gong’, Zhiyi Wei* & Yanli Wang! 


Clustered regularly interspaced short palindromic repeats (CRISPR) 
together with CRISPR-associated (Cas) proteins form the CRISPR/ 
Cas system to defend against foreign nucleic acids of bacterial and 
archaeal origin’’. In the I-E subtype CRISPR/Cas system, eleven 
subunits from five Cas proteins (CasA,B2C,D,E,) assemble along 
a CRISPR RNA (crRNA) to form the Cascade complex'®*. Here 
we report on the 3.05 A crystal structure of the 405-kilodalton 
Escherichia coli Cascade complex that provides molecular details 
beyond those available from earlier lower-resolution cryo-electron 
microscopy structures. The bound 61-nucleotide crRNA spans the 
entire 11-protein subunit-containing complex, where it interacts 
with all six CasC subunits (named CasC1-6), with its 5’ and 3’ termi- 
nal repeats anchored by CasD and CasE, respectively. The crRNA 
spacer region is positioned along a continuous groove on the con- 
cave surface generated by the aligned CasC1-6 subunits. The five 
long f-hairpins that project from individual CasC2-6 subunits extend 
across the crRNA, with each B-hairpin inserting into the gap between 
the last stacked base and its adjacent splayed counterpart, and posi- 
tioned within the groove of the preceding CasC subunit. Therefore, 
instead of continuously stacking, the crRNA spacer region is divided 
into five equal fragments, with each fragment containing five stacked 
bases flanked by one flipped-out base. Each of those crRNA spacer 
fragments interacts with CasC in a similar fashion. Furthermore, 
our structure explains why the seed sequence, with its outward- 
directed bases, has a critical role in target DNA recognition. In conclu- 
sion, our structure of the Cascade complex provides novel molecular 
details of protein-protein and protein-RNA alignments and inter- 
actions required for generation of a complex mediating RNA- 
guided immune surveillance. 

Our 3.05 A crystal structure of the E. coli Cascade complex exhibits a 
sea-horse-shaped architecture, similar to previous cryo-electron micro- 
scopy structures’®. Instead of forming a simple, compact structure, the 
11 subunits are assembled into two structural layers. Six CasC pro- 
teins, together with CasD and CasE, tightly pack to form the outer layer 
(Fig. lb-d and Extended Data Fig. 1). Within this outer arch-like con- 
figuration, the six CasC proteins labelled C1 to C6 are assembled into 
a symmetry-related helical alignment defined by a ~40° rotation, to- 
gether with a vertical shift of 25 A, per subunit. CasE and CasA cap the 
two ends of the CasC helical arrangement by associating with CasC1 
and CasC6, respectively. The inner layer, consisting of the CasA and the 
CasB dimer, is connected to the outer layer mainly via CasA—CasD inter- 
actions as well as multiple relatively weak CasC contact sites at their 
lateral regions (Fig. 1c). We observed all 61 nucleotides of the crRNA, 
with the 32-nucleotide spacer shown in red and the two flanking repeat 
segments (also called 5’- and 3’-handle) shown in orange in Fig. 1. The 
crRNA is properly positioned along the long outer layer (Extended Data 
Fig. 2). Interestingly, each subunit in the outer layer contains a long 
B-hairpin protruding from its structured core (Fig. 1d). 

The CasE endonuclease subunit at the head end of the Cascade com- 
plex adopts a two-ferredoxin fold, similar to the RNA-bound structures 
of Thermus thermophilus Cse3'*'° and Pseudomonas aeruginosa Csy4"°. 


The 3’-end of the crRNA adopts a stem loop-fold, with the RNA stem 
anchored within the positively charged cleft of the carboxy-terminal 
domain of CasE (Fig. 2a, b and Extended Data Fig. 3a). Its B6-B7 hair- 
pin, a universal structural feature of the CasE family, contains several 
positively charged residues that contact the crRNA backbone. The 3’- 
end repeat of crRNA is recognized by the side chains of R111** and 
R113", which insert into the major groove of the RNA stem and 
contact the bases of G(+10) and G(+ 19) (Extended Data Fig. 3b, c). 
With the following continuous turn, the 3’ terminal repeat surrounds the 
66-87 hairpin, a feature not observed previously, and is probably respons- 
ible for proper anchoring of the crRNA in the Cascade complex (Fig. 2b). 
At the opposite face of the crRNA stem-binding surface, a V-shaped cleft 
is formed between the two domains of CasE. There, a specific loop from 
CasCl1 residues 197-208 inserts into this V-shape cleft of CasE via 
hydrophobic interactions (Fig. 2c and Extended Data Fig. 3d). 

Once the crRNA is Cascade-bound, the 8-nucleotide repeat of its 5’- 
end takes on a hook-like shape, with a sharp turn at the G1-G(—1) and 
G(—1)-C(—2) steps, preceded by continuous stacking within the C(—3)- 
A(—5) steps, and three residues pointing into a different direction at 
each base (Fig. 2d). The 5’-handle is buried into a pocket formed by 
CasD, CasC6 and CasA ina sequence-specific manner (Fig. 2d-g). Res- 
idues C(—2), C(—3), A(—5) and U(—7) interact with CasD and CasC6 
sequence specifically. The base of G(—1) stacks onto the side-chain of 
L89@*P_ R108@*, which is located within the 65-6 hairpin, forms 
hydrogen bonds with the base of C(—2) (Fig. 2e). The CRISPR/Cas sys- 
tem cannot launch a defence against the invading plasmid once R108" 
is substituted by Ala (Extended Data Fig. 4a). F129@** and K177* 
form respective hydrogen bonds with the bases of C(—3) and A(—5), 
which are further stabilized by stacking with the side chains of F129°*** 
and R206“, respectively (Fig. 2e and f). The 5’-terminal residues 
A(—6), U(—7) and A(—8) are sandwiched by amino acid residues; that 
is, A(—6) between F208 and P19@S”, U(—7) between Y145°*? 
and the peptide plane of residues 38-39" on the «1 -helix, and A(—8) 
between Y142@°P and D179°@-_R4g@sP pair. Additionally, the 2’ 
hydroxyl and phosphate groups of the 5’-handle provide hydrogen 
bonds and ionic bonds with CasD (Fig. 2f, g and Extended Data Fig. 2). 

A specific 85-86 hairpin of CasD, which contacts CasA and CasC6, 
has a critical role in 5'-handle binding. This hairpin extends across the 
RNA backbone to CasC6 through the gap between G(— 1) and C(—2), 
in similar fashion to that of CasC2-6 (discussed below). In addition, a 
loop formed by residues 120-135**“ inserts into the cavity between 
the B-hairpin and the main body of CasD (Fig. 2h and Extended Data 
Fig. 4c), and a loop of CasC6 (residues 164-179) inserts into the oppos- 
ite groove (Extended Data Fig. 4d), indicating a role of the B-hairpin for 
proper assembly of CasA and CasCé6 into the Cascade complex. In the 
crRNA-free structure, this arm is highly flexible’’, suggesting mutual 
stabilization between CasD and bound crRNA, which is supported by 
the fact that upon replacement of the B-hairpin by a non-functional 
(GGS), linker, the Cascade complex cannot be assembled properly and 
Cascade-mediated immunity was abolished (Extended Data Fig. 4a, b, 
e, f). 
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Medical Research, Addenbrooke’s Hospital, Hills Road, Cambridge CB2 OXY, UK. “Department of Biology, South University of Science and Technology of China, Shenzhen, 518055, China. 


6 NOVEMBER 2014 | VOL 515 | NATURE | 147 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


Cascade 


Figure 1 | Crystal structure of the 
Cascade complex from E. coli. 
a, The I-E subtype CRISPR system in 
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Six CasC subunits form the backbone of the Cascade complex. The 
proximal (inner side of the helix) and distal domains (outer side) of each 
CasC adopt a concave shape (Fig. 3). Except for CasCl, all CasC prox- 
imal domains contain long B-hairpins (Fig. 1d), which are flexible in the 
, and stabilized upon crRNA-Cascade assembly. 


RNA-free strucures'*”” 
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b, Overall structure of the Cascade 
complex. c, Cartoon scheme of 
Cascade showing the structural 
uniqueness in both the crRNA 
recognition and the subunit 
organization. All subunits in the 
outer layer contain a long hairpin 
protruding from the structural core 
for stapling the crRNA spacer, for 
specific binding of the stem-loop 
structure, and for the outer layer 
organization. The sophisticated 
protein interaction network in the 
Cascade organization is shown. 

d, Individual components are shown. 


The B-hairpin of each CasC extends up into the groove of its preceding 
CasC subunit, where it is locked by interactions of the B-hairpin resi- 
dues D204°*°, D205°° and L206“**° with the surrounding groove 
residues (Fig. 3a, b and Extended Data Fig. 5a—-c). Consistently, mu- 
tating D204-L206“*© to Ala abolishes Cascade-mediated immunity 


Figure 2 | The two ends of the crRNA are 
anchored. a, The 3’- repeat segment winds the 
66-67 hairpin of CasE. b, Positioning of non- 
helical residues (+1)-(+5). U(+5) is inserted into 
the cleft between two hairpins. U(+4) and G(+3) 
is inserted into the concave surface. A(-++2) makes a 
break from G(+3) and stacks with G(+1). ¢, An 
o-helix from CasCl1 inserts into a cleft of CasE. 

d, The 5’-end repeat is buried by CasD, CasC6 and 
CasA. e-g, Expanded views of the interactions 


between the Cas proteins and the 5'-repeat 
segment. h, A loop of CasA is inserted into the 
cavity of CasD. 


(Extended Data Fig. 5d). Interestingly, because CasCé6 is the last CasC 
subunit of the helical arrangement, it employs a CasD B-hairpin, even 
though CasD shares no similarity in overall folding with CasC. 

Our structural analysis revealed an intriguing feature of a kink every 
6 nucleotides within the crRNA spacer fragment. The 32-nucleotide spacer 
is embedded into a contiguous, positively charged groove formed by 
CasC1-6. The residues 1 to 5,7 to 11, 13 to 17, 19 to 23 and 25 to 29 are 
continuously stacked. Intriguingly, these stacks are interrupted by five 
distinct breaks at steps 5-6, 11-12, 17-18, 23-24 and 29-30. The 6th, 
12th, 18th, 24th and 30th residues flip out of the stacked helix, generat- 
ing five regularly spaced kinks across the spacer, thus creating a discon- 
tinuous stacking arrangement of five similar fragments, each containing 
five stacked bases and one base splayed out (Fig. 3a, b). Interestingly, 
the kink positions precisely colocalize with those regions of the five 
B-hairpins projecting from CasC2-6 that cross the narrow gap at the 
break steps. Those two features are stabilized by the side-chains of 
F200*° and V203“**“ stacking with the respective two bases at each 
break site, and by hydrogen bonds formed between the B-hairpin and 
the crRNA phosphate backbone (Fig. 3b). 

The 5-nucleotide stacked fragments between the kinks are positioned 
on the concave surface of the outer layer and are bound to their respect- 
ive CasC motifs (Fig. 3a and Extended Data Fig. 2). Importantly, the 
5-nucleotide segments are organized into an A-form helix, with the bases 
extending outwards, which is crucial for proper base pairing with target 
DNA. In contrast, the splayed out bases at the kink sites all face the dis- 
tal domain of CasC, preventing any base pairing with target DNA, and 
most likely resulting in a non-continuous crRNA-DNA target duplex. 
Consistent with our structural finding, five site mismatches at the kink 
positions have a minimal effect on the target binding affinity, whereas 
mismatches positioned immediately upstream or downstream of the 
kink sites reduce the binding affinity by 20-30-fold (Extended Data 
Fig. 5e, f). 

Residues 1-5 and 7-8, located within the first and second 5-nucleotide 
stacked segments of the spacer, are essential for target- DNA recognition” ”. 
In our structure, nucleotides 1-5, which are present on the surface of 
CasC6, are continuously stacked and extend outwards (Fig. 3c and 
Extended Data Fig. 6a). Residue 6 flips away from stack residues 1-5 
by 180°, and therefore cannot pair with the DNA target, suggesting that 
it is not crucial for target recognition and explaining why this residue 
is not part of the seed sequence. Following this kink at position 6, the 
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Figure 3 | The spacer fragment is 
positioned in the continuous 
groove on the concave surface of 
CasC1-6. a, The spacer fragment in 
red is not continuously stacked 

with five kinks present at regular 
intervals, namely at positions of the 
6th, 12th, 18th, 24th and 30th 
nucleotide. Seven important 
B-hairpin arms from CasD, CasE and 
CasC2-6 subunits are highlighted. 
b, A B-hairpin of CasC3 inserts 

into the concave surface of CasC2. 
c, The interaction between the 
crRNA seed segment and CasC5-6. 
d, e, Structural comparison between 
CasC1 (cyan) and CasC2 (grey) 

(d), and between CasC5 (grey) and 
CasCé (blue) (e). 
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base edges of residues 7-8 face in a similar direction as residues 1-5. 
Residues 7-8 continuously stack with residues 9-11, thus pairing between 
residues 7-8 and the target DNA should promote the formation of the 
crRNA-DNA target duplex, thereby facilitating the propagation to the 
3’ terminus of crRNA. Compared with the other 5-nucleotide stacked 
segments, partially obstructed by the CasB dimer, the first two segments 
containing the seed region are more accessible for the target strand. 
Moreover, the binding of proto-spacer adjacent motif (PAM)”* to CasA 
allows for target recognition to start from the 5’-terminus of the spacer 
(Extended Data Fig. 6b-g). In conclusion, the fact that the seed bases 
face outwards towards solvent, together with the specific position near 
the tail end of the Cascade complex, allow for easy pairing with the DNA 
target, reminiscent of a short interfering RNA seed segment in Argo- 
naute complexes”. 

Structural comparison of the six CasC subunits revealed that, unlike 
the long B-hairpin arms of other CasC subunits, the corresponding re- 
gion of CasC1 forms an «-helix, rotates by ~90° (Fig. 3d), and extends 
to the cleft of CasE (Fig. 2c and Extended Data Fig, 3d). F200°* ° and 
V203 stack with the nucleotides at the break site, whereas F200 
and V203“**“' form hydrophobic interactions with CasE, thereby pro- 
moting Cascade complex formation. As observed in previous cryo- 
electron microscopy structures’”°, CasC6 adopts a different conformation 
from other CasC proteins. The rotation of the distal domain enables 
CasC6 to interact with CasD forming the 5' repeat segment binding 
pocket, and the particular conformation of the 85-86 hairpin arm of 
CasD prevents CasCé from taking on the same conformation as the other 
CasC subunits (Fig. 3e and Extended Data Fig. 5b). 

The inner layer of Cascade is comprised of CasA and CasB1-2 sub- 
units. As mentioned above, one loop of CasA inserts into the concave 
surface of CasD, while the C-terminal domain of CasA contacts CasB2 
and CasC5-6 (Fig. 4 and Extended Data Fig. 7a, b). The elongated dimer 
of CasB1-2, located along the inner surface of the crRNA-CasC spine, 
is not involved in crRNA binding. Instead, it interacts with CasA, CasC 
and CasD, thus facilitating the Cascade assembly. Due to the curved 
shape of each CasB subunit, the asymmetric CasB-CasB interaction turns 
the CasB dimer into a curved structure as well, which is important for 
the interaction of CasB with three continuous CasC subunits (CasC1-3 
with CasB1 and CasC3-5 with CasB2, Figs 1c, 4 and Extended Data 
Fig. 7c-f). Binding of the inner layer to the CasC helix is mediated by 
five unique binding spots, each of which is formed by a negatively charged 
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Figure 4| Assembly of CasA and CasB to the six-CasC helix mediated by the 
unique Asp-Arg/Lys-Trp triad. The ribbon representation of the inner layer 
and the CasC helix is shown in the background. Each triad alignment is 
highlighted by red circle and the structural details are presented accordingly. 


residue D22* that bridges to a positively charged residue (R27 or R101 
of CasB1 or CasB2, as well as K474 of CasA), and is stabilized by stacking 
with the aromatic side chain of W199 of the following CasC molecule. 
Thus, the Asp-Arg/Lys-Trp triad unites two neighbouring CasC with 
either CasA or CasB (Fig. 4 and Extended Data Fig. 7). 

Upon viral invasion, the Cascade complex binds double-stranded DNA 
exhibiting sequence complementary to the crRNA spacer sequence, and 
then recruits Cas3, a nuclease-helicase that cleaves the bound target 
DNA”™”*~°°. To understand the molecular basis for target recognition, 
we fitted each subunit structure into the cryo-electron microscopy map” 
and compared the structures with target bound and unbound. Overall, 
the outer layer displays negligible intrinsic motion, except that CasE 
rotates around the CasC1 binding site by ~ 15° in an anti-clockwise di- 
rection, moving the crRNA stem-loop upward as a result (Extended Data 
Fig. 8a—d). In contrast, the inner layer undergoes a large conformational 
change upon target binding. While CasB1 and CasB2 move along the 
CasC helix for ~10-12 A, CasA rotates by ~25°, resulting in new in- 
terfaces of CasA-CasC and CasB-CasC (Extended Data Fig. 8c, e). 
Due to these conformational changes, the binding channel widens at 
the seed region, allowing the crRNA-target DNA duplex to fit in, and 
the motion of the CasB dimer stabilizing the target (Extended Data 
Fig. 8f, g). 

In conclusion, our structural study has revealed that the crRNA plays 
an essential role not only in target recognition, but also in cascade com- 
plex assembly. Furthermore, the elongated B-hairpin loop from Cas pro- 
tein represents a multifunctional element that serves a critical role in 
spacer RNA stabilization, CasC helix formation, and CasA—CasD as 
well as CasC-CasA and CasC-CasB interactions. Our structure also re- 
veals a unique RNA-binding motif, in which a specific crRNA binding 
mode is used several times to assemble a multi-component complex. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Cloning, expression and purification of the Cascade complex. The five Cas genes, 
CasA, CasB, CasC, CasD and CasE, were amplified from E. coli K12 genomic DNA 
by polymerase chain reaction (PCR) and subcloned into different expression vec- 
tors. CasA was inserted into pET22b (Novagen, Amp®). CasB was inserted into 
pRSEFDuet-1 (Novagen, Kan®) containing an N-terminal His,-tag. CasC, CasD and 
CasE were inserted into a modified pCDFDuet- 1 (Novagen, Str®). Template CRISPR 
was chemically synthesized from Sangon Biotech (sequences provided below) and 
subcloned into pACYCDuet-1 (Novagen, Chl). Generation of the CasD and CasC 
point mutants was performed by site-directed mutagenesis. All constructs were con- 
firmed by sequencing. All four vectors were then co-transformed into the E. coli 
BL21 (DE3). Overexpression of Cascade was induced with 0.2 mM isopropyl-B-p- 
thiogalactoside (IPTG) at an OD¢00 nm of 0.4. After growing for 20 h at 16 °C, cells 
were harvested by centrifugation, homogenized in 1X PBS buffer A (pH7.3) 
containing 5% glycerol. After sonication and centrifugation, supernatants were 
incubated with Ni Sepharose resin (GE Healthcare) for 2h and eluted with an 
increasing imidazole gradient in buffer A. Then, the recombinant complex was fur- 
ther purified by Heparin HP column (GE Healthcare) and MonoQ 5/50 (GE Health- 
care) with buffer B (20 mM HEPES, pH 7.5, 100 mM NaCl) and buffer C (20 mM 
HEPES, pH 7.5, 1 M NaCl). For crystallization trials, the protein complex was pu- 
rified by gel filtration chromatography (Superdex 200 10/300 GL, GE Healthcare) 
in buffer D (20 mM HEPES, pH 7.5, 10 mM DTT, 1 mM EDTA). The peak fractions 
were collected and concentrated to about 4 mg ml” for crystallization. The entire 
purification procedure was performed at 4 °C and purity of the Cascade complex 
was verified by SDS-PAGE. 

Crystallization and data collection of Cascade complex. Crystals of the Cascade 
complex were grown at 16 °C using the hanging drop vapour diffusion method with 
a reservoir solution containing 100 mM Tris-HCl, pH 8.5, 200 mM NaAc, 100 mM 
Glycine, 50 mM LiAc and 11-13% (w/v) polyethylene glycol (PEG) 4,000. Crystals 
appeared after 4 days and grew to full size within 2 weeks. For data collection, crys- 
tals were transferred into mother liquor supplemented with 20% (v/v) glycerol and 
flash-cooled in liquid nitrogen. Diffraction data were measured and collected at 100 K 
with wavelength of 1.1 A at beamline BL1A, BL5A and BLI17A of Photon Factory 
(Japan, KEK) or at beamline BL17U of Shanghai Synchrotron Radiation Facility (SSRF). 
All data sets were processed using HKL2000 or iMosflm*'~*. The crystals were iden- 
tified as belonging to space group P1, with two Cascade complexes per asymmetric 
unit. 

Structure determination and refinement of the Cascade complex. By using the 
8.8-A electron microscopy map of the target-unbound Cascade complex asa search 
model, we found an optimal solution for molecular replacement in PHASER**™. 
To improve the phase the non-crystallographic symmetries (NCS) for two Cascade 
complexes and for five CasC molecules were identified and applied in density mod- 
ification process using RESOLVE”. The phase was dramatically improved with the 
powerful NCS average and then was gradually extended to 3.5 A (Extended Data 
Fig. 9). The structural model was manually built and refined in Phenix” against the 
3.05 A data set. COOT”” was used for model building and adjusting. The final model 
was validated using MolProbity** and Procheck”’. The refinement statistics are listed 
in Extended Data Table 1. All structure figures were prepared using PyMOL (http:// 
www.pymol.org/). Individual Cas proteins are colour-coded. The repeat seg- 
ments of crRNA are in orange and the spacer is in red. 

Cascade mutagenesis and gel filtration assay. Two types of Cascade mutants, 
CasCA and CasDA, were produced. In CasCA, residues Phe 200, Asp 204, Asp 205 
and Leu 206 of CasC were mutated to Ala. In CasDA, residues 75-104 of CasD were 
replaced by a (GGS)z linker. CasCA and CasDA were cloned and expressed using 
the same strategy as used for wild-type Cascade. Both mutants were purified using 
Ni-affinity resin and analysed by gel filtration assay (Superdex200 10/300 GL, GE 
Healthcare). The assays were performed in a buffer containing 20 mM HEPES 
pH7.5 and 100 mM NaCl. The relevant fractions were then analysed by SDS-PAGE 
and visualized by Coomassie blue staining. 

pACYC-target invasion assay. For our invasion assay, we used a CRISPR system 
containing a 32-nucleotide spacer: 5‘-GGCTCCCTGTCGGTTGTAATTGATAA 
TGTTGA-3’. The complementary sequence of the spacer (5’'-TCAACATTATC 
AATTACAACCGACAGGGAGCC-3’) followed by a PAM sequence (ATG) was 
constructed into pACYC-1 between the restriction sites NdeI and Xhol, yielding 
plasmids that contain an artificial protospacer target (pACYC-target). E. coli BL21 
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(DE3) cells that contain CRISPR-encoding plasmids with either the wild-type or 
mutant Cascade genes together with Cas3 were inoculated in LB containing kana- 
mycin (50 pg ml~ Ay), ampicillin (100 jug ml 1) and streptomycin (50 pg ml~?), and 
grown to an OD¢00 nm of 0.3. Expression of both Cas genes and CRISPR was induced 
for 45 min with 0.5 mM IPTG. Cells were then collected at 4°C and made com- 
petent according to standard protocols. Transformation was performed by adding 
either 60 ng pACYC-target or 60 ng pACYC. 

Next, cells were grown in LB for 60 min at 37 °C before plating on LB-agar plates 
containing 0.5 mM IPTG, ampicillin (100 pg ml’), kanamycin (50 pig ml), strep- 
tomycin (50 pg ml *) and chloramphenicol (34 1g ml‘) . Plates were incubated 
for 12h at 37 °C before observation. 

The pACYC control without target sequence did not trigger CRISPR/Cas inter- 
ference, and was therefore retained, with no inhibition of growth observed. In con- 
trast, the plasmid pACYC-target containing target sequence was recognized and 
degraded by CRISPR/Cas system, growth inhibition observed on selection medium. 
Plasmid loss assay. E. coli BL21 (DE3) strains expressing either wild-type Cascade 
(WT) or Cascade complex in which the hairpin-arm of CasD was replaced with a 
(GGS), linker (DA), were transformed with pACYC-target plasmid. Cultures were 
inoculated in LB containing kanamycin (50 pg ml '), ampicillin (100 1g ml‘), strep- 
tomycin (50 wg ml 1) and chloramphenicol (34 ug ml~ 1), and grown to an OD¢00 nm 
of 0.6. Expression of cas genes and CRISPR was induced for 5 h with 0.5 mM IPTG. 
Cells were serially diluted and plated on LB-agar non-selective plates (containing 
50 ug ml~! kanamycin, 100 pg ml~! ampicillin and 50 pg ml“! streptomycin), or 
selective plates containing additional 34 jg ml ' chloramphenicol. All plates were 
incubated for 12 h at 37 °C before observation. If the plasmid can trigger CRISPR/ 
Cas interference, it will be recognized and degraded. As a consequence, growth will 
be inhibited on selection medium. Alternatively, the plasmid will remain intact, 
and colonies will appear on selection medium. 

Isothermal titration calorimetry (ITC). DNA oligonucleotides for ITC were dis- 
solved in buffer containing 20 mM HEPES, pH 7.5, and 100 mM NaCl. The Cascade 
complex for ITC measurements was purified by Superdex 200 with HEPES buffer 
(20 mM, pH 7.5) containing 100 mM NaCl. The purified Cascade complex (1 1M) 
was loaded into the cell, and the ssDNA solution (10 |1M) in the syringe. Experi- 
ments were carried out at 20 °C using a MicroCal ITC200 calorimeter (GE Health- 
care). The titration data were processed and fitted using Origin 7.0 software. 
Template CRISPR sequence (from 5’ to 3’). GAGTTCCCCGCGCCAGCGGGG 
ATAAACCGGGCTCCCTGTCGGTTGTAATTGATAATGTTGAGAGTTCCC 
CGCGCCAGCGGGGATAAACCGGGCTCCCTGTCGGTTGTAATTGATAAT 
GTTGAGAGTTCCCCGCGCCAGCGGGGATAAACCGGGCTCCCTGTCGG 
TTGTAATTGATAATGTTGAGAGTTCCCCGCGCCAGCGGGGATAAACCG 
GGCTCCCTGTCGGTTGTAATTGATAATGTTGAGAGTTCCCCGCGCCAG 
CGGGGATAAACCGGGCTCCCTGTCGGTTGTAATTGATAATGTTGAGAG 
TTCCCCGCGCCAGCGGGGATAAACCGGGCTCCCTGTCGGTTGTAATTG 
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Extended Data Figure 1 | Overall surface view of the Cascade complex with the same orientations as those used for Fig. 1b. 
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Extended Data Figure 2 | Schematic summary of Cas-protein-crRNA interactions. Hydrogen bonds and salt bridges are indicated by dashed lines. Cation-1 
interactions are indicated by wavy lines. 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


Extended Data Figure 3 | Structure of CasE subunit in the Cascade complex. — sequence-specific interaction between CasE and the major groove of 3'-end 
a, CasE is shown as a surface representation, and is labelled according to repeat (b), and the interaction between CasE and 3’-end repeat backbone (c). 
electrostatic potential (red, negative charge; blue, positive charge), and RNA is _ d, Expanded view of the interaction between CasE and CasCl. 

shown in ribbon representation (orange). b, c, Magnified view of the 
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Extended Data Figure 4 | The f-hairpin arm of CasD is critical for Cascade 
function. a, pACYC-target invasion assay. Competent E. coli BL21 (DE3) cells 
expressing either wild type Cascade (WT) or a mutant form in which the 
hairpin-arm was replaced with a (GGS), linker CasD mutant (DA), or R108A, 
R108A/Y142A CasD mutants, were transformed with either pACYC plasmid 
(Control) or pACYC-target plasmids (Target). Colony-forming units per 
microgram pACYC (CFU per jug) are depicted for each of the strains. The 
statistics was based on 10 replicates. Error bars represent the standard error 
of the mean (s.e.m.). Upon transformation with pACYC plasmid control, 

E. coli cells expressing WT Cascade exhibited much higher transformation 
efficiencies than cells transformed with pACYC-target plasmid. However, the 
transformation efficiencies were high in E. coli cells expressing mutant CasD 
and transformed with either pACYC control or pACYC-target. These results 
show that the hairpin-arm of CasD is essential for proper function of the 
Cascade complex. b, Plasmid loss assay. E. coli BL21 (DE3) cells expressing 
wild-type or casDA(75-104)-replaced Cascade complex and pACYC-target, 
were serially diluted and grown on either non-selective or selective plates upon 
induction with 0.5 mM IPTG. Growth of BL21 cells expressing WT Cascade 
was seriously inhibited on selection medium compared with growth on non- 
selection medium. However, BL21 cells with mutant CasD displayed similar 


growth on selection and non-selection medium, indicating that the target 
sequence was retained upon introduction of the CasD mutant. c, Interface 
between CasD (green) and CasA (salmon). d, Interface between CasD (green) 
and CasCé (blue). e, Pull-down and gel-filtration analysis of the Cascade 
assembly. WT Cascade complex was formed effectively in the Ni-NTA column 
and was eluted out with elution buffer containing 100 mM imidazole. In 
contrast, very limited amounts of the co-expressed Cascade complex with the 
DA mutant or CA mutant were obtained in the 100 mM imidazole elution, 
with the majority of the Cas proteins were in the flow-through. The fractions 
eluted with 100 mM imidazole were collected and applied for the analytical gel 
filtration assay. Compared with the WT Cascade, both DA mutant and CA 
mutant Cascade did not form a stable complex in gel filtration, further 
confirming the critical role of the B-hairpin arm of CasD and CasC for the 
assembly of the Cascade complex. The CasC mutant (CA) contains F200A 
mutation as well as replacement mutation, where residues D204 to L206 were 
replaced by Ala. f, Structural comparison among E. coli CasD (green) and 
Cas5d in the RNA-free state of Streptococcus pyogenes (wheat, PDB id: 3VZH) 
and Bacillus halodurans (grey, PDB id: 4F3M). The B-hairpin arm of CasD is 
highlighted by a red dashed circle. 
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Extended Data Figure 5 | The unique multi-kinked conformation within 
the crRNA spacer region. a, The six CasC subunits form a right-handed helix, 
with a groove for crRNA binding. Except for CasC1, the B-hairpin (highlighted 
by a black arrow) of each CasC molecule extends up into the groove of its 
preceding CasC, and thus results in five kinks observed for the crRNA spacer. 
b, Superposition of CasC5 (cyan) and CasCé (blue). The top of long B-hairpin 
in CasD is shown as green spheres, depicting the clash between the distal 
domain of CasCé and the long B-hairpin top given it adopts the same 
conformation as CasC5. c, CasC (cyan) shares a similar overall fold with 
Sulfolobus solfataricus Cas7 (grey, PDB id 3PS0). d, The pACYC-target 
invasion assay showing that mutating three interacting residues (D204, 
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D205@"°, L206*°) at the B-hairpin arm largely impairs Cascade activity. 

e, Four designed DNA targets with complementary sequence (WT) to the 
spacer segment of crRNA or with non-complementary mutations at kinked 
sites (M1), at immediately upstream nucleotides (M2), and at immediately 
downstream nucleotides (M3). The five kinked sites are highlighted as yellow 
background, with the mutated sequences in purple. f, ITC-based analysis of 
the Cascade complex and four DNA targets interactions. ITC analysis showed 
that the mutations of the flipping of these residues had no obvious effect on 
target recognition. However, mutations in either the first or the last nucleotides 
in the 5-nucleotide stacked region largely affected target binding. 
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Extended Data Figure 6 | Structural analysis of the crRNA spacer segments. _ identical. Like those of CasC2-6, the B-hairpin arm of CasD is involved in 


a, The first 5-nucleotide segment in the spacer region is positioned in the maintaining the kink at G(— 1). Interestingly, although the overall foldings are 
positively charged groove of CasC. b, Overlap of the five 5-nucleotide segments _ distinct between CasD and CasC, their B-hairpin arms share similar 
within the crRNA space region and their interacting Cas proteins. The conformations, which are highlighted by a black box. c-g, Structural 


5-nucleotide segments are interrupted by single flipped-out bases, which are comparison of the five 5-nucleotide stacked segments with Cas proteins shown 
indicated by a black arrow, thereby resulting in kinks at these positions. The _ in the same orientation. The Cascade complex is colour-coded by chains as 
conformations of the five segments together with the kink sites are essentially shown in Fig. 1. 
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Extended Data Figure 7 | Inter-subunit interactions required for Cascade CasA-CasC (b), CasA—CasB2 (c), CasB1-CasB2 (d), CasB1-CasC (e) and 


assembly. a, Ribbon representation of the organization of CasA and CasB CasB2-CasC (f) interactions. The five triads, which connect CasA and the CasB 
dimers and their associations with the CasC helix. The inter-subunit interfaces dimer to the CasC helix, are highlighted with red circles. Hydrogen bonds 
are highlighted with dashed-line boxes. b-f, The structural details of the and salt bridges are indicated by dashed lines. 
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Extended Data Figure 8 | Structural comparison between target-bound and 
unbound Cascade. a, The building of the structural model of the target-bound 
Cascade. The subunits were individually docked into the 9A-electron 
microscopy map. The fitted model is shown in two views with the electron 
microscopy density overlapping (contoured at 20). b, Ribbon representation 
of the target-bound Cascade. c, Subunit motions. The movement was 
represented by red-green lines, which were drawn by connecting each Ca atom 
(green) in the target-bound Cascade to corresponding Cx atom (red) in the 
unbound Cascade after the two Cascade structures were superimposed. 

Thus, the lengths of the lines correlate with the motion scale. The motions of the 
outer and inner layers are shown by superimposing the target-bound with 
unbound Cascade structure in d and e, respectively. The target-free structures 
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are rendered semi-transparent. The translational and rotational motions are 
indicated by arrows showing the motional direction with translation distances 
and rotation angles labelled, respectively. f, g, Comparison of electrostatic 
surface potentials for the CasB dimer in the target-free (f) and target-bound 
Cascade (g) with cartoon representations of the crRNA spacer and its paring 
target (green). A highly positively charged groove of the CasB dimer fits 

well with the negatively charged target backbone in g, but not in f. Also, the 
paring target partially clashes with the CasB dimer in f, with the clashing 
site highlighted by a yellow circle. The structural analysis suggests that the 
motion of the CasB dimer identified in the target-bound Cascade complex is 
required for proper target binding. 
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Extended Data Figure 9 | Electron density maps showing the model quality. _ in Fig. 1. b, The 2F, — F, map contoured at 1¢ of the whole crRNA is shown 
a, The 3.5 A-electron density map contoured at 1.50 generated from the with the refined RNA model superimposed. Zoom-in views of the 5'-end, 
phase after density modification treatment with the final structural model 3'-end and the 5-nucleotide stacked segment are shown in the lower left, 
superimposed. Co backbones are shown in with the same colour-codingas used _ lower right and upper insets, respectively. 
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Extended Data Table 1 | Statistics of data collection and refinement 


Cascade 

Data collection 
Space group Pl 
Cell dimensions 

a, b, ce (A) 111.41, 118.14, 225.87 

A, B, ¥ C) 92.24, 93.55, 106.06 
Wavelength (A) 1.1 
Resolution (A) 50-3.05 (3.16-3.05) 
Rmmerge (%) 9.1 (40.5) 
I/sigma 11.7 (2.1) 
Completeness (%) 98.8 (98.3) 
Redundancy 3.05 (2.9) 
Refinement 
Resolution (A) 45-3.05 
No. reflections 195,250 
Rwork / Reree (%) 16.4/20.7 
No. atoms 

Protein 53257 

RNA 2596 
Average B (A’) 

Protein 43.4 

RNA 45.6 
R.m.s.deviations 

Bonds (A) 0.006 

Angle (°) 1.2 
Ramachandran statistics (%) 

Most favorable 89.8 

Additionally allowed 9.8 

Generously allowed 0.3 

Disallowed 0.1 


Values in parentheses are for the highest resolution shell. 
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INTERACTIVE NOTEBOOKS: 
SHARING THE CODE 


The free [Python notebook makes data analysis 
easier to record, understand and reproduce. 


BY HELEN SHEN 


Brown is taking a deep dive into his stu- 
dents’ research code. The long journey 
from Michigan State University in East Lan- 
sing to a conference in Melbourne, Australia, 
provides the perfect chance for the bioinforma- 
tician to scrutinize his lab’s new algorithm for 
removing errors from RNA sequencing data. 
Three years ago, Brown might have waited 
until he was back in his office. It is difficult to dig 
into other researchers’ code without them being 
present to explain it, make changes and produce 
updated results. But these days, Brown can work 
with his lab from afar using a free, open-source 
software package called IPython, which helps 


Fis high above the Pacific Ocean, Titus 


researchers to keep a detailed lab notebook for 
their computational work. 

Brown’s students write explanatory text and 
intersperse it with raw code and the charts and 
figures that their algorithms generate. Sitting in 
the aeroplane with an IPython notebook down- 
loaded to his laptop, Brown can interact with the 
work. He tweaks and re-runs the code, which 
executes directly in the document he is read- 
ing — allowing him to see instantly whether 
his changes are improving the algorithm. “Ican 
go through their notebook, understand exactly 
what they did and modify it, explore different 
parameters and lookat different views; he says. 
“T can do this from anywhere in the world” 

Designed to make data analysis easier to share 
and reproduce, the [Python notebook is being 


used increasingly by scientists who want to keep 
detailed records of their work, devise teaching 
modules and collaborate with others. Some 
researchers are even publishing the notebooks 
to back up their research papers — and Brown, 
among others, is pushing to use the program 
as a new form of interactive science publishing. 


BETTER BOOKKEEPING 

The IPython notebook was developed in 2011 
bya team of researchers led by Fernando Pérez, 
a data scientist at the University of California, 
Berkeley, and computational physicist Brian 
Granger at California Polytechnic State Univer- 
sity in San Luis Obispo. “We built it by solving 
problems that we ourselves had as researchers 
and educators,’ says Pérez. 
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PROGRAMMING 


[Python for beginners 


Getting the tools 

An interactive Nature demonstration of an 
IPython notebook is available at go.nature. 
com/ohkiks. The software for the notebook 
can be downloaded from the IPython 
website at go.nature.com/ma8nip. 


Learning the ropes 

Instructions for proficient coders can be 
found at go.nature.com/sdbolb; example 
notebooks are at go.nature.com/awtkxn. 

For people unfamiliar with Python, a host 
of resources exist online. OpenTechSchool’s 
educational unit on data analysis with 
Python (go.nature.com/gpuyxp) includes an 
introduction to the IPython notebook. 


>» Pérez and Granger saw that data scientists 
found it hard to share detailed but understand- 
able descriptions of their raw code that would 
allow others to build on their research. That is 
partly because many scientists in computation- 
intensive fields write code in an iterative and 
piecemeal fashion as each analysis reveals new 
insight and spins off multiple lines of inquiry. 
Keeping track of the different versions of code 
that produce various figures, and linking those 
files with explanatory notes, is a headache. 
And what gets published is usually not detailed 
enough for the reader to follow up on. “In my 
own computational physics work,’ says Granger, 
“a high-level description of the algorithm that 
goes into the paper is light years away from 
the details that are written in the code. With- 
out those details, there is no way that someone 
could reproduce it in a reasonable time scale” 

The [Python notebook addresses both issues 
by helping scientists to keep track of their work, 
and by making it easy to share and for others 
to explore the code. The T in [Python refers to 
an ‘interactive command window that helps 
users to run code, access variables, call up data 
analysis packages and view plots, while the 
Python refers to the popular programming 
language that the notebook is based on. (Pérez, 
Granger and their colleagues are now moving 
the notebook into a project called Jupyter, which 
aims to make [Python more compatible with 
other languages, including Julia and R). 


CODE CHOPS 

At the University of Texas at Austin, Tal Yarkoni 
uses the [Python notebook to run automated 
meta-analyses of brain imaging studies to 
uncover patterns of neural activity involved in 
language processing, emotion and other pro- 
cesses. The psychoinformatician plans to pub- 
lish the notebooks as companions to his future 
journal articles. “The more complicated the 
analyses, the greater the benefits of being able to 


For hands-on training, Software 
Carpentry, a volunteer organization that 
teaches basic software skills, can help. The 
group will run two-day workshops on the 
IPython notebook by invitation throughout 
the world (go.nature.com/fj6sza). 


Sharing the goods 

Python notebooks can be shared in online 
repositories such as GitHub, or over e-mail. 
Recipients need |Python software to view 
and edit the notebooks. Notebook authors 
can also use a program called nbviewer to 
create an online version of their notebook 
that will be viewable, but not editable (go. 
nature.com/ry6g4)). 


convey all that in one simple document,’ he says. 

Applications similar to the [Python note- 
book already exist for various programming 
languages. Mathematica and Maple — commer- 
cial analysis packages popular among mathema- 
ticians — include notebooks or notebook-like 
programs. MATLAB, a commercial package 
used heavily in signal processing, engineering 
and medical-imaging research, also supports a 
notebook application. Each of these notebooks 
is specialized for its corresponding proprietary 
programming language. 

A number of notebooks and notebook-like 
programs exist in the open-source world; knitr 
works with the R coding language, which is 
especially powerful for statistical analysis. And 
the Sage mathematical software system, which 
is also based on the Python language, supports 
its own notebook. Dexy is a notebook-like pro- 
gram that focuses on helping users to generate 
papers and presentations that incorporate prose, 
code, figures and other media. 

But the [Python notebook has become one 
of the most widely adopted programs of its 
kind, says Ana Nelson, the creator of Dexy. “So 
many people have heard of it who haven't heard 
of any other tool,” she says. Granger and Pérez 
do not know how many people have tried their 
software, but say that traffic to the website sug- 
gests that roughly 500,000-1.5 million people 
actively use the program. Nelson says that it is 
the best-designed of the digital notebooks, and 
attracts many users because it is free and open 
source. The application also benefits from the 
popularity of the Python language, which boasts 
a robust scientific community that meets for an 
annual international conference, and is (rela- 
tively) easy for novice 
programmers to learn. 

Although a growing 
number of researchers 
are publishing their note- 
books alongside papers 


NATURE.COM 
For more on scientifc 
software, apps and 
online tools, visit: 
nature.com/toolhox 
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(see go.nature.com/mgonbm for examples), it 
may still be some time before journals accept 
the documents as full journal articles. 

A handful of IPython notebooks have been 
published as books, and many professors use the 
program to make interactive curricula. But so 
far, the notebooks seem to have been published 
only as addenda to papers — often to provide 
analysis code and additional explanation in 
method sections. 

“Publishers, I would say, still aren't convinced 
that they want to go the whole way,” says 
Granger. The data format may be too new, he 
says, for journals to recognize the notebook as 
an official document format, such as html or 
pdf. But the [Python team has begun talks with 
a few publications. 


START FROM SCRATCH 

Most IPython notebook users are skilled 
programmers, but experts are helping to intro- 
duce beginners to coding through the software 
(see Python for beginners’). Yan Song, a post- 
doc at the University of California, San Diego, 
had no programming experience until about 
three months ago. She works on the ‘wet’ side 
ofa cellular and molecular medicine lab, where 
she designs experiments and collects data that 
computational scientists — on the ‘dry’ side — 
help her to mine for information. 

Song looks for changes in RNA expression in 
mouse and human stem cells as they differen- 
tiate into various types of neuron. In the past, 
she used Excel to compare expression patterns 
between groups of cells at different develop- 
mental stages. But earlier this year, she began to 
examine RNA sequencing data from single cells 
and her data sets exploded in size and complex- 
ity. Instead of analysing a few groups of cells, she 
had to compare hundreds of cells at once, and 
in each one she examined around 1,500 separate 
genes related to neural development. 

Olga Botvinnik, a bioinformatics graduate 
student in that lab, started to generate the results 
in an [Python notebook, so Song began play- 
ing with the analysis code — out of a mixture of 
curiosity and impatience, she says. “It seemed to 
be an easy interface. You can code one line and 
you can see whether it works right away.’ 

Within a few weeks, Song had picked up some 
basic [Python programming skills, finding 
support through online tutorials and message- 
boards. Botvinnik has also written some cus- 
tomized menus and widgets to let Song explore 
her data using different clustering algorithms. 

Song still relies on Botvinnik for help with 
intensive computational analyses, but says that 
she is now starting to explore the data on her 
own, using her biological knowledge to exam- 
ine particular subsets of cells or genes, which 
she can suggest to Botvinnikas leads for further 
analysis. “We used to speak two different lan- 
guages. I would talk about the biology and she 
would talk about coding. Now we have common 
ground; we can communicate to each other 
better. This accelerates our research,” she says. m 
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COLUMN 
Ittakes time anda 
team to win grants 


Start and finish early, seek feedback and file before deadline, 
says Ingrid Eisenstadter. 


of grants for a small family foundation 

that supports scientific research, I have 
reviewed a few thousand grant propos- 
als. All our applicants are people who were 
bright enough to get PhDs and MDs, but 
the proposals we receive tend to share the 
same flaws, whether they come from recent 


I: the 25 years that I have been director 


graduates or from researchers with years of 
experience. 

Applicants often submit proposals in which 
the research protocol is insufficiently planned 
or explained. The language is sometimes too 
technical for reviewers who do not specialize 
in that discipline. The proposal text can be 
wordy or fails to convey the study’s novelty 


or urgency. These problems inevitably result 
from applicants’ failure to allow themselves 
enough time to write the proposal and to cir- 
culate it to colleagues, advisers and depart- 
ment heads for feedback. This pattern is 
repeated twice a year, every year, when our 
submission deadlines approach. 

Iam also amazed anew each time to find 
that most of our grant-seekers wait until five 
minutes to midnight to meet our published 
deadlines. Yes, we circulate and read last- 
minute applications, but we have less time 
to ask for clarification or extra information 
in this flood tide because the clock is ticking 
for our next board meeting. And founda- 
tions always get more good proposals than 
they can fund. 


ALLOW ENOUGH TIME 

Scientists cannot plan their protocols for 
hypotheses, goals, controls, methodologies 
and analyses and then write, edit, proofread, 
copy-edit, chart, graph and lay out their 
work effectively and error-free without input 
from colleagues. If your institution does not 
have an internal review process, then you 
are already at a disadvantage in the heated 
competition for funding and should take the 
initiative and ask your co-workers to critique 
your efforts. This means finishing your draft 
well in advance of submission dates. You 
need to give yourself enough time to polish 
your proposal — and to get useful, meaning- 
ful input on it. Two months ahead may not 
be too early. 

Scientists are not trained as writers, and 
their applications would often benefit from 
editing. Although the proposals we receive 
do not usually contain vocabulary or gram- 
matical errors, they are frequently repeti- 
tive. Often, the very point of the research is 
deeply buried in the proposal and does not 
emerge until well after a lengthy discussion 
of the background, when it should appear in 
a brief introduction or a summary at the top 
of the document. I also find with surprising 
frequency that important information — the 
current population of an endangered spe- 
cies, for example, or why a species should 
be studied at all — is missing, either because 
applicants think “everyone knows that’, or 
because details are lost when the focus is 
on the big picture. Do not make this mis- 
take — it results from being too close to your 
own work to read it objectively, and you can 
avoid it by seeking comment and by schedul- 
ing enough time into the process tolet the > 
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> proposal rest for a week or two and then 
rereading it with a fresh eye. 

For example, we once received a request 
for funds to study an endangered primate. Yet 
the applicant did not mention until halfway 
through the proposal that fewer than 300 of 
these animals had not yet been wiped out by 
Homo sapiens. | called the applicant and sug- 
gested that they add that number to the pro- 
posal’s title before I circulated it. The person 
said, “Oh! Right!”, laughed, quickly resubmit- 
ted — and was funded. Had that single men- 
tion of this crucial number been missing 
altogether, there is no knowing what the result 
would have been, especially if it had been sub- 
mitted during our biannual flood tide. 


DON’T FORGET THE DETAILS 
Inadequately planned or poorly explained 
research are other common problems. 
These, too, can be corrected with input from 
a neighbouring bench or two. For instance, 
we recently heard from a scientist who put 
considerable effort into a proposal to study 
the effects of forest fragmentation on a natu- 
rally occurring hybrid tree. The regions to be 
compared included a swamp that had been 
drained to become a farm in colonial times; 
an artificially created urban park; a new sub- 
urban park; and others — all without any 
explanation for these site selections or, for 
that matter, of the broader significance of this 
choice of the tree for study. The application 
cited no references. It was not funded. 

Another time, we received a proposal for a 
genetic study that clearly needed to include 
epigenetics, considering the speed with 
which the change under investiga- 
tion had taken place. 

These proposals, as written, 
should never have left their 
home institutions. Had they 
been reviewed by the applicants’ 
colleagues, perhaps these basic prob- 
lems would have been spotted in 
time for the applicants to recast 
their protocols. 

Almost all the grant appli- 
cations that we receive seek 
our maximum funding level, 
or an amount very close to it. In 
addition to coverage for supplies, 
lab fees, travel and other outlays, 
most applicants want salary sup- 
port — whether or not they are 
obliged to raise their own sal- 
aries — and a contribution 
to the overheads of their 
institutions. I have spo- 
ken to many employ- 
ees at other small- to 
medium-sized founda- 
tions who say the same 
thing. ‘Shooting for the 
Moon does not enhance 
your chances of funding. 


s 


Many foundations’ websites provide a 
history of the grants that were funded, and 
you should use those figures to guide your 
budget decisions. In the United States, some 
foundations’ websites include their annual 
federal tax forms (called Form 990), which list 
all the grants awarded each year. You should 
research this information well in advance of 
writing a grant, and tailor your application to 
the standards of the foundation to which you 
are applying. Include a budget justification 
that explains each expense, so that the foun- 
dation knows what it is paying for. There is 
no procedure that bars bargain-hunters from 
serving on foundation boards, and strong pro- 
posals that seek less rather than more may be 
more favourably viewed. 

About 20 years ago, we got a request for 
US$700 — which had never happened 
before and has not happened since — from 
a researcher who wanted fieldwork support 
to study the threatened blue copper butterfly 
(Lycaena heteronea), a beauty of the western 
United States. We awarded the grant with- 
out hesitation. A year later, our lepidopterist 
reported that his butterfly was not as threat- 
ened as he feared. Our board of directors was 
as delighted as he was. 

It is a mistake to assume that all the grant 
reviewers at non-government funding 
organizations who will ultimately vote on 
your proposal are scientists. Unless you are 
writing about particle entanglement, use 
plain and non-technical language when- 
ever possible. If you do not, or if your topic 
makes that impossible, your proposal may 
well go to referees, leaving grant decision- 
makers to depend on someone else’s opin- 

ion. If you start the application process 

with a letter of inquiry — a brief memo 

that discusses your work — 

you should already have 

planned your research and 

should know how much 

you need in funding. Con- 

sider including the amount 

of your grant request in the 

letter, because if you later sub- 

mit a full proposal that asks for 

much more than what the board 

had expected, your chances of 
funding are diminished. 

Funders do understand that 
letters of inquiry are sometimes 

vague about research plans 
because investigators are 

seeking expressions of 
interest before taking 
the time to prepare a 
detailed protocol and 
full proposal. None- 
theless, early-career 
researchers as well as 
senior scientists should 
realize that it is difficult 
for vague letters of inquiry 
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to compete with those that make it clear that 
there is a complete research plan behind them. 
Subheads are an important navigation tool 
for proposal evaluators. Use them to high- 
light the importance and novelty of your 
work, and be clear; for example, write “This 
species is now endangered’, rather than “This 
species is now on the IUCN Red List. Even 
when you are following a specific question- 
and-answer formula 


“Unless you’re created by the grant- 
writing about giver, consider add- 
particle ing subheads to 
entanglement, emphasize your 
use plain and proposal’s strengths 
non-technical and urgency. Some 
language grant-givers have 
whenever such a strict set of 


questions that there 
is little opportunity 
to explain the goal or necessity of your work. 
If so, add an ‘introductior subhead to bring 
out these points, and if you keep the accom- 
panying text to a few sentences that enable 
you to address the issue missing from the 
one-size-fits-all questionnaire, you may not 
get into trouble. 


possible.” 


ILLUSTRATE WELL 

Similarly, photos, charts and graphs should 
highlight and emphasize the importance and 
significance of your work. Now that technol- 
ogy has facilitated the use of photos in grant 
proposals, we are seeing them more often. If 
you plan to use them, remember that they 
should be informational, not decorative. You 
also need to remember that evaluators will 
look at photos, charts, graphs and their cap- 
tions before they read the text on that page, 
so captions should underscore the signifi- 
cance of the work. 

It is also important to explain the future 
ramifications of your research after you 
complete the current phase for which you are 
seeking funding. That information is often 
missing. If your research will facilitate oth- 
ers’ investigations, or will continue in some 
other way to ripple in the water, then say so, 
whether your proposed research programme 
is basic or applied. Do not leave the evaluators 
of your proposal to have to figure this out. 

Most of the researchers and institutions 
that we have funded end all communication 
with us when we get their final reports. But 
every now and again, wise researchers send 
us copies of their publications as the years 
pass, along with a note that explains the 
relevance of the studies to the earlier work 
that we funded. This practice boosts your 
chances of success should you ever want to 
seek funding again. It is also the courteous 
thing to do. m 


Ingrid Eisenstadter is director of grants 
for The Eppley Foundation for Research in 
New York. 
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BOUNDARY WATERS 


BY MARISSA LINGEN 


he had not expected the howls. 
All babies cried, she knew — or at 


least she thought she knew. Somehow 
it was different when it was her own, and 
all her theories about calmly going about 
her business went out the window. But the 
modifications for gills were supposed to 
make time in the water a joy, a delight — she 
had pictured a cooing, fascinated baby, not 
this shrieking red monster. 

She had pictured almost a modern selkie 
baby, large-eyed and at peace in the waves, 
wise beyond her days, able to cope with the 
modern world of rising tides and tempera- 
tures. 

And that baby did come, eventually 
— after 20 minutes or so in the waves, 
settling into breathing in the water and 
then wailing like the torments of the 
damned again when they took her out. 

“I wish shed make up her mind,” she 
said, joggling the baby in the eternal 
parental soothing dance, still half- 
towelled and tangle-haired herself. 

“Maybe we should just... leave the 
swimming alone for awhile; he said. “If 
she doesn't like it” 

They took the baby to the doctor, and 
also to the genetic-modification counsel- 
lor, to see if something was wrong with the 
amphibious changes they’d selected. Cor- 
vallis was close enough to the coast — even 
before the sea level had risen, there had been 
professors with beach homes — that there 
were others with the same modification 
package. The clinics were experienced. 

They found no problems. They shrugged 
and told the parents not to let her skin get 
too dry, not to use harsh perfumed lotions 
on the gill region. 

“Harsh perfumed lotions on a baby,” she 
snorted in the car. 

“Some people don’t know,’ he soothed. 

“She needs to ... she needs to be comfort- 
able with her heritage.” 

“Tl teach her to cook nopales so they 
don't get slimy,” he promised. 

This was not what she meant. 

The requisite long bath times were a 
struggle at first. Then it was eerie and upset- 
ting to find an eight-year-old lying perfectly 

still under the water 
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seemed like something. When it was just the 
child and the water, perfectly still, the laugh- 
ing selkie baby of imagination seemed never 
more distant. 

“Kids, said her mother. “You shouldn't 
think you know what you're getting, because 
you never, ever do. You were just like that, 
hours at a time.” 

“In the tub?” she said. “I never.” 

“No, up in that tree, the big maple on the 
side yard.” 

“That was different.” 

Her mother snorted. 


She was 22, and it was a long drive inland, 
parents and U-Haul and all. She had thought 
she would just go on her own, but her parents 
said there were no easy flights and the bus was 
a mess and... there were all sorts of reasons 
to take her, and once they were taking her, 
why, may as well take the old second-hand 
kitchen table theyd outgrown, some chairs, 
all the things they imagined shed need. 
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Theyd never expected shed go so far east, 
but once she‘ tasted fresh water, she was 
home, and there was no going back to the 
harsh tang of salt, the familiar and foreign 
greeting of sea stars and kelp. 

They helped her unpack, shelving her lim- 
nology textbooks in the duplex shed found; 
brick and white clapboard up in the steep 
hills, identical to all the other blue-collar 
buildings perched there, shocking them 
with its cheap rent. Theyd helped her stock 
up on all the food shed need to keep her 
weight up, keep her fat stores up to survive 
in the water, even with the wet suits specially 
cut around her gills. 

They wanted to tour the campus. They 
wanted to see the piers, the ore docks, the 

sand and black-stone beaches. They tried 
to talk about how the forests were like the 

forests back in Oregon, but they were 

not like. Nothing smelled like. The 
4G water, when they were gone, would 
= not taste like. 

When they drove off, she sat and 
cried, and her tears tasted treacher- 
ously, so unfairly, of salt. No matter 
how long she spent under the surface, 

spring through autumn, that wouldn't 
change. 

There would be others like her, in her 
graduate programme, others who were 
modified to be here, quiet and loath to 
speak, slow moving, large, glad of the cold, 
glad of the silences. They found pockets of 
zebra mussels. They fled the coasts. They 
analysed traffic. 

Their parents had no idea what to do with 
them. 

She walked down to the beach. Most of 
the people on it were unmodified, but there 
were enough like her that they were too 
polite to watch, used to it. She slipped into 
the waves, and they muffled her cries. 

Even in her own kind of water, she still cried 
out. She had never stopped crying out, with 
the change. But the freshness helped, sliding 
into the depth helped. The quiet helped. 

She crouched in the cool, cool corner and 
ate her fish, thinking about the shipping 
lanes and what to do next. 

She had never been able to explain to her 
mother that all the modification could do 
was make the transitions possible. It could 
never make them easy. = 


Marissa Lingen has published more than 
100 short stories in venues such as Analog, 
Lightspeed and Tor.com. 
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